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i.-- ,01 DNA molecules in cells using homologous recombination 
(57) Abstract: Disclosed herein are methods for S^^^^^^y homologous recombination in bacterial cells, 
mediated by recombinases and similar proteins. The methods P»"»««h«J the generation of tmnsgenic and knockout 

Td n euSryotic cells such as mammalian cells. The ^ h ^^~^! A faints without the need for restriction 
anfmals, Xne replacement. The methods are also useful J-I^ ^ J Qr creating spedfic Rations in 

enyzmes. The methods are also useful tor repairing «^° r ™ J ^ h ^ fficie „c y homologous recombination, 
the genome. Also disclosed are bactenal strams which are useful n.g 
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ENHANCED HOMOLOGOUS RECOMBINATION MEDIATED BY LAMBDA 

RECOMBINATION PROTEINS 
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FIELD 

The present disclosure relates to methods to enhance homologous recombination 

inbacteriaandeukaryo^^ . 
from bacteriophage lambda. It also relates to methods for modifying genomic DNA m 
bacterial artificial chromosomes (BACs) and to subcloning of genomic DNA from 
BACs into multicopy plasmids. 

BACKGROUND OF THE INVENTION 

Concerted use of restriction endonucleases and DNA ligases allows in vitro 
recombination of DNA sequences. The recombinant DNA generated by restriction and 
ligation may be amplified in an appropriate microorganism such as E. coli, and used for 
diverse purposes including gene therapy. However, the restriction-ligation approach 
has two practical limitations: first, DNA molecules can be precisely combined only if 
convenient restriction sites are available; second, because useful restriction sites often 
repeat in a long stretch of DNA, the size of DNA fragments that can be manipulated are 
limited, usually to less than about 25 kilobases. 

Homologous recombination, generally defined as an exchange between 
homologous segments anywhere along a length of two DNA molecules, provtdes tat 
altemafivemethodforengineeringDNA. In generating recombinant DNA with 
homologous recombination, a microorganism such aa K coli, or a euharyotic eel! such 
as a yeas, or vertebrate cell, is transtamed with exogenous DNA. The center of the 
exogenous DNA contains the desired transgene, whereas eaoh flank contains a segment 
of homology with the cell's DNA. The exogenous DNA is unreduced into fire cell wrth 
standard techniques such as electroporation or calcium phosphate-medtated 
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, • ■ ♦ r-iv« DNA for example with the assistance of 
transfection, and recombines into the cell s DMA, tor e* v 

recombination-promoting proteins in the cell. 

In generating recombinant DNA by homologous recombination, it is often 
advantageous to work with short linear segments of DNA. For example, a mutation 
may be introduced into a linear segment of DNA using polymerase chain reaction 
(PCR) technics. Under proper circumstances, the mutation may then be mtroduced 
L cellular DNA by homolo g ous recombination. Such short hnear DNA segments can 
.ansformyeast.butsubse.uentmampulationofrecombinantDNA^ 

r eadilytransformbac» 

Accordingly, recombinants are rare, require special poorly-growing strain, (such a 
RecBCD- mutant strams) and generally require thousands of base pan. of homology. 
Thus, improved methods of promoting homologous recombination in bactena are 
needed. 

, toeukaryottc cells, targeted homologous combination provides a basis for 

ttB enng and altering essentiaUy any desued seuueuce in a duplex DNA molecule, such 
33 Igehng a DNA sconce m a cbromoaomo for rep— by another scenes. 
The approach may be useful for hooting human genehc d.seases. 

Homologous recombination has been used .o create knock-out — « and 

gene funonom Transgenic - are organisms ma, confain stably tntegrated jes 
of genes or gene constructs derived horn anomor species in the chromosome of the 
^genic animal These animals can be generated by introducing ^ _ 
JL of the foreign genes into totipotent cells by a variety of methods, moludmg 

25 homologous recombination. 

Curreuttymethod.forproducmghuusgeuioshavobeeuperfonnedonbo.ipo.en, 

e^oric stem cells (ES) and wifh feruhzed zygotes. ES ceUs have an advantage m 
ft* targe numbers of cells can be maculated in viho before Mr are used to gene^ 
panics. Alternatively, DNA can also be induced into fertilized oocytes by 
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, • injected into the germline of organisms including C. 
micro-injection into pronuclei, or injected into g 

elegans or Drosophila species. 

j ' i ~a tn Hetect and/or select for targeted site- 
Several methods have been developed to detect ano/or 
Several met tnr DNA md the target homologous chromosomal 

seque nce(Capecchi,Sci^ 

afte ri^vdirectselecuononmeappropriategro^ 

t ^ - or under promoter control, and successM transfection can he 
incorporated into a vecior mm v igom 

b, seiecling 04. — **- - ~ ^^les an, 
„ o to se,ecnon P— - — - £ ^ ^ 85: 

8583, 1988; Dorm. * al. Science ^ ^ 

1991 , and Snouwaert et al. Scrence , 

mu st be transfected, selected, and screened m order gene 

homologous recombinant. 
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SUMMARY OF THE DISCLOSURE 

„ • e DNAencodinglambdarecombinases operably linked to a de-repressible 
havmg DNA encoding ^moter is activated, by 

^.forex^pledrelambdaPLpronrorer. IhePLpron , 

^e shift, inereoy fading «o exp^on o acids 

The nucleic acids undergoing recomtnnatron may be inrra 
^ach—al, for example in abacterial artificial chromosome. 
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Tfce present disclose also provides method, for inducing homologous 
recombination using single-stiunded DNA molecules, hy inking £ DHA 

p ^vpeptide capable of promoting homologous recombiuatiou. Such 
5 J bindmg polypeptides hrclude hunbda Beta, KecT, P22 Erf, and Rad5 , as we,, - 
^cdonmfagm^andvariantsofsingle-^DNAbmdingpolypephdes 

The present disclosure ate, provides bacteria! cells that promote efficmn, 
homologous recombination. These bacteria, ceils contain one or more genes or 
pr „mo,ers from a defective .ambda prophage within me bacteria, chromosome. 
,0 The disclosure ate, provides methods for altering eukaryotic genes by 

expressing recombinases opemb.y hnxed to a de-repressible promoter in bactena, ce Is 
Znl genes thus modified can he used to modify euharyotic ceUs, for examp>e ,o 
generate transgenic or knockout animals. 
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BRIEF DESCRIPTION OF THE FIGURES 

F5g 1 is a linear depiction of the defective lambda prophage as it is integrated 
onme*. coli chromosome. Prophage genes are mdicated by the solidline, and* coU 
genes by broken lines. The coding regions for the lambda recombinan ts „. 
and J are approximately in the center of the defective prophage. Lambda genes era 
Jgh deleted from me defective prophage, are enclosed withm parentheses 
together with the E. coli bio A gene to indicate their deletion. 

Fig 2A is a schematic diagram showing classical recombinant technology, and 
Fig. 2B is a schematic diagram showing "recombineering" nsing homologous 

recombination, as disclosed herein. 

Fig Jisagmphfta.showsfineoffeotofinducfiouumeofmel.mbdaproteina 

Beta Exo and Gam on recombination efficiency. A linear chloramphenicol resistance 
J*to L uaed to target prophage genes, and was electioporaten into cells after the 
oefts had been heated * 42 °C for Uae time indicated. Time of induction (tempemture 
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shift to 42 °C) is plotted on the x-axis, against number of chloramphenicol-resistant 
recombinants obtained on the y-axis. 

Fig. 4 is a graph that shows the effect of linear DNA amount on recombination. 
The£ Coli strain DY330 was induced at 42 °C for 15 minutes, and DNA encoding a 
linear chloramphenicol resistance cassette (<cat>) at the indicated concentration was 
electroporated into cells. DNA concentration encoding <cat> is plotted on the x-ax 1S , 
against number of chloramphenicol-resistant recombinants obtained on the y-axis. 

Fig. 5 is a graph which shows the effect of homologous arm length on 
recombination in E. coli cells induced for expression of lambda recombination proteins 
Beta Exo,andGam. A chloramphenicol resistance cassette was synthesized with 
homologous arms of the indicated length at its 5' and V end. Homologous arm length 
was varied from 0 to 1,000 base pairs. Length of homologous arms is plotted on the x- 
axis, against number of chloramphenicol-resistant recombinants (log scale) on the y- 
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axis. 



Fig. 6 is a linear depiction of the modified defective lambda prophage as 

. ,• u „~„„fnv™n "FL250 and EL350 cells. This figure 

integrated on the E. coli chromosome of DY38U, HL^du, anu du 

illustrates that the defective prophages used for BAC engineering contain the X genes 
from ci857 to int. P L and P R denote the lambda left and right promoters, respectively. 
The gam and red genes, exo and bet are under the control of Pu which is repressed by 
the temperature-sensitive repressor, cI857 at 32°C and derepressed at 42°C. tet replaces 
the segment from cro-biok in DY380 cells. The araCY^pe cassette or the araC- 
P BAD cr e cassette replaces the segment from cro-biok in EL250 or EL350 cells, 
respectively. The promoter of the araBAD operon (Pbad), which can be induced by L- 
arabinose, controls the expression of the J** or ore genes. Thick black lines designate 
the prophage while thin lines represent E. coli sequence. < > defines the ends of the 
cro-bioA region that was replaced with tet, araC-Y^flpe, or amC-P B AD ere. 

Fig. 7 illustrates a strategy for BAC engineering. This figure illustrates the 
relative position of the Eno2 gene in the fully sequenced 250-kbp BAC, 284H12 and the 
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n \^tV.e»1a<?texon of £no2. In the targeting 
different steps used to introduce Cre into the last exon on* 

cassette FRT sites are denoted by ellipses, the kan gene by a red rectangle and the 
GFPcre fusion gene by a blue rectangle. The green boxes represent Eno2 exons. 

Fig 8 is a schematic diagram illustrating the use of the gap repair to 
subclone fragments as large as 80 k b P from BAG, Fig. 8A shows as short thick black 
^ws the location of the 5' homologies ontiie amplification primers used to amphfy 
PBR322 for subcloning by gap repair. Each primer also contains 20 nt segments at its 
3- endtoprime P BR322. ** and ** cleavage sites were included in these pnmers* 
facilitate release of the subcloned fragments from the plasmid backbone. The loca on 
of Spel restriction sites near Eno2 is also shown ("S"). Spel restriction sites are not 
, reS entonthelmearampli fi ed P BR322vecto, Fig. 8B shows an intermediate ,ep in 
gap repair, pairing between a typical amplified P BR322 targeting cassette and the 
modifiedWBAC. Ap, amp resistance gene; OKI, origin of rephcahon 

Fig 9 is a schematic diagram of a defective I prophage. The defective 
prophage DY380 expressing the , Exo, Beta, and Gam functions is shown w,th the 
genes under ft promoter control and the temperature sensitive repressor, CIS 57. 
Advantages and disadvantages of the systems are described. The genes encoding Cr 

. • ^ Twien H3T ?S0 and EL350) and replace the 
and Flpe are present on other derivatives of DY380 (EL250 ana , 

tet gene as shown. 

Fig 10 i saschemali C diagra I nofmv.v 0 c 1 ontogbyrcco m btaa tl cn«3 1 nggap 
repair of a line, vector plasmid The memod of » - Cooing uses ». B- ^ 
a voc.0, and a target DNA, that have homologies * eaoh other a. then ends. Bo* are 
eUCroporated inro oompolent cehs to allow recombination and gap repair of the 

piasmid. The hnosx vector is made in a similar way «o<ha, shown mftg. «. 
5 ng.uiaasehemattomagramofmv^renievmgofDNAfromBACelone, 

Retrieving of segments up ,o 80 k bp from BACa into PCR-ampUEed veo»rstasb=en 
poasibie naingreoombineeringteehniquesdiseloaedherein. Hereoniyme asmrdta 
taarized and transformed into a reoombination eompeten, eel! oontatmng ft, , BAG 
Recombination oooora between homologiea on me end of me linear veotor and me 
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BAC This method eUminates standard cloning technology from theBAC,and 
hnportantly, the cloned segment is never replicated ft, vitro, thereby reducing the chance 

of extraneous changes in the sequence. 

Fig-Kiaaschematicdtagramofamini-lambdaDNAcircl.. Thislambda 

DNA dement is no. a plasmid and lacks any replication origin activity. It does eontaur 
a , lambda clS 57 repressor and thepi operon that the repressor controls. It also 
contains a cassette encoding a drug marker, in this case the tei genes. This DNA when 
ttansfbrmed into most attains including the BAC attains makes In. protein aHownrg 
integration of tite circular DNA a. me X attachment site on fte bacterial chromosome 
and CI857 repressor to allow repression of pL. The in.egra.ed mini-lambda is stable bu« 
able to be induced a. 42«C to activate Gam, Bete, and Bxo expression to make me cell 

recombination competent. 

Fig 13 is a schematic diagram of recombination of ssDNA into the genome. 
When ssDNA is electroporated into cells, it is bound by Beta protein and recombmed 
into the genome or into a BAC plasmid by homology. Evidence suggests that Beta- 
bound ssDNA anneals to its ssDNA complement at the replication fork. The strand of 
DNA corresponding to that made by lagging strand synthesis is most recombmogemc 
suggesting that Beta simply anneals the ssDNA to a gap caused by DNA replication. 

DETAILED DESCRIPTION OF SPECIFIC EMBODIMENTS 

There exists a need in the art for methods of precisely and efficiently altering 
predetermined endogenous genetic sequences by homologous recombination in vivo. 
There independently exists a need in the art for high-efficiency gene targeting, so as to 
avoid complex in vitro or in vivo selection protocols. 

25 One or more of the foregoing needs axe met by the methods disclosed herein, 

which include methods for cloning DNA molecules in cells using homologous 
recombination mediated by lambda recombinases and similar proteins. 

One such method uses a cell having DNA encoding functional Exo, Beta, and 
Gam or fragments or variants thereof, operably linked to the a de-re P ressible promoter 
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(e a the PL promoter). De-repression of the de-repressible promoter (e.g. the induction 
of transcription from the pL promoter by inactivation of d) induces expression of exo, 
bet and gam, and in some embodiments may be selectively activated for this purpose. 
A nucleic acid (such as a polynucleotide which is homologous to a target DNA 
sequence) capable of undergoing homologous recombination is introduced into the cell, 
and cells in which homologous recombination has occurred are either selected or found 
by direct screening of cells. In particular embodiments, the nucleic acid introduced mto 
the cell may be double strand DNA, or DNA with 5' overhangs. 

In additional particular embodiments, at least in 1 in 5000 cells contain DNA in 
which homologous recombination has occurred. In further embodiments, at least 1 in 
1,000 cells, or 1 in 500 cells, 1 in 100 cells, or 1 in 20 cells contain DNA in winch 
homologous recombination has occurred. 

The cell may be a eukaryotic cell, or a prokaryotic cell, such as a bacterial cell, 
for example an E. coli strain. The DNA encoding the lambda recombination proteins 
l5 andPLpromotermaybeintrachromosomalorextrachromosomal. Similarly, the target 
DNA sequence maybe intrachromosomal or extrachromosomal; for example, the target 
DNA sequence maybe found in a chromosome of the cell or aplasmid (including 
derivatives of coffil, pSClOl, P 15A and shuttle vectors that replicate inbothbactena 
and eucaryotic cells), bacterial artificial chromosome, PI artificial chromosome, yeast 
20 artificial chromosome, cosmid or the like. 

In additional particular embodiments, the nucleic acid introduced into the cell 
may be double-stranded DNA or DNA with a 5' overhang, and may include a positive 
or negative selectable marker. The introduced nucleic acid may alter the function of a 
nucleic acid sequence such as a gene in the cell, or add a gene to the DNA of the cell. 
The cell may be treated to enhance macromolecular uptake, for example using 
electroporation, calcium phosphate-DNA coprecipitation, liposome mediated transfer, 
or other suitable methods, to other particular embodiments, the method may produce 
homologous recombination that alters the function of a gene in the cell, or adds a gene 
to the cell. 
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In further particular embodiments, the cell may be treated to enhance 
macromolecular uptake, for example with electroporation, calcium phosphate-DNA co- 
precipitation, DEAE-dextran-mediated transfection, polybrene-mediated transfection, 
microinjection, liposome fusion, lipofection, protoplast fusion, inactivated adenovirus- 
5 mediated transfer, HVJ-liposome mediated transfer, and biolistics. 

Another such method that meets one or more of the above-mentioned needs 
includes methods which introduce into the cell DNA capable of undergoing 
homologous recombination and a ssDNA binding polypeptide capable of promoting 
homologous recombination. In particular embodiments, the DNA is ssDNA or DNA 
10 having 3' overhangs. 

The single stranded DNA (ssDNA) binding polypeptide is a type of DNA 
binding polypeptide which mediates double strand break repair homologous 
recombination by binding to ssDNA or a 3 ' overhang in dsDNA and promoting 
recombination. It promotes recombination by annealing the bound ssDNA to its 
15 complement in thecell. Examples of such ssDNA binding polypeptide include lambda 
Beta,*. CO / I RecT,Erfofbacteri 0 phageP22,andRad52. The ssDNA binding 
polypeptide may be introduced as a nucleic acid. For example, a nucleic acid that 
expressesthessDNAbindingpolypeptideisintroducedintoa cell, such as a eukaryotic 

cell. Expression of the ssDNA binding polypeptide from a nucleic acid maybe 
20 induced, for example, by activation of an inducible promoter. Inparticular 

embodiments, the nucleic acid may further include lambda exo and gam, and the 
inducible promoter may be the lambda PL promoter. In other embodiments, the ssDNA 
binding polypeptide is introduced into the cell as a polypeptide. 

The cell used in methods disclosed herein may be a bacterial cell such as an E. 
25 coli strain, or a eukaryotic cell such as a mammalian cell, a stem cell, or virtually any 
other eukaryotic cell type. The DNA used in the method may be a single 
oligonucleotide sequence, or may be two or more overlapping sequences, for example 
with more than 10, or more than 20 base pairs of complementary overlap at either their 
5' or 3' termini (in specific examples of the 5' case, the nucleic acid includes exo and bet 
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nucleic acid seances,. The DNA may compnse a se.ec.able matker, and homologous 
recombination with die ssDNA may confer a selectable ph.no.ype npon the cell. In 
particular embodiments, the cell may be treated da enhaaaee macromotalar uptake, 
such as with elecfroporation, ca.ciumpbospha.e-MA coprecipitadon, liposome 
5 mediated transfer, or otber suitable methods. The effee, of homologous reeombmahon 
.nay be to alter the thnedon of a gene in the eell, or add a gene to the oell. 

m particmar examples, me ssDNA is nsed in an amonn. of about 0.01 pM to 
abon, 10 * or from abon, 0.1 pM to about , mM; or from about 1 pM to about 100 
pM in other exan*.es, me ssDNA binding polypeptide is used ha an amount of 0.001 
, 0 p to about 0.0. pM, or from about 0.01 pM to about .0 mM; or horn about 0.1 pM to 
about 1 mM; or from about 1 pM to about 100 pM. 

Aiso disehased are baeterial cells that may be useful in practicing dae disclosed 
m e*ods. These include bacteria! cells harboring a defective lambda prophage of 
genotype XoI857 A(cro-bioA). InparticuUr examples, dae bacteria, cells ma> .have a 
selectable mariter, sueh aa an antibiotic resistance marker, uosfream of dhec 857 g«ae 
fc some particular examples, dae diselosed bacterial ceUa inelnde an mducrble promoter 
upsfream of dae cI857 gene, whioh may he operably connected to a gene eneodmg a 
K combmase, anoh aa Op, flpe, or ere, or a gene encoding functional fragments or 
™*ds of these reoombinaae, In odter particular examples, tiae haetenal cells may 
20 contain a bacterial artificial chromosome, which may have a selectable marker, tap, 
and/or FRT sites, ha particular «nbodimen«, tire selectable marker on dae bactena, 
rtficial chromosome is excisable by a reoombinaae. In oter particular example ft. 
baeterial artificial chromosome may have at least one exon or at least one infron of a 
mammalian gene. 

25 The disclosure also includes methods of altering eukaryotic genea by expressing 

ta a bacterial cell an mfrachromosomaa gene encoding a recomhmase operably linked to 
a PL promoter. The bacterial cell also includes an exfractaomosoma. eukaryotic gene 
or gene fragment (having at least one infron or a. leaat one exon of a enkaryotic gene). 
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A nucleic acid capable of undergoing homologous recombination with the eukaryouc 
gene is introduced into the bacterial cell, and the nucleic acid undergoes homologous 
recombination with the eukaryotic gene or gene fragment. In a particular embodiment, 
the nucleic acid undergoes homologous recombination with a targeting frequency of at 
5 least about 1 in 1,000. 

m one embodiment, the expressed recombinase is a double strand break repair 
recombinase, such as lambda Beta or other single-stranded DNA binding proteur, 
lambda Exo, or lambda Gam. In another embodiment, the extrachromosomal 
eukaryotic gene or gene fragment may be located on a bacterial artificial chromosome, 
10 yeast artificial chromosome, PI artificial chromosome, plasmid or cosnud. In yet 
another embodiment, the eukaryotic gene or gene fragment is derived from a 
mammalian organism, such as a mouse or human. 

In several additional embodiments, the nucleic acid undergoing homologous 
recombination may encode a recombinase, functional fragments or variants of a 
15 recombinase, or an epitope tag. 

Also disclosed are methods of altering intrachromosomal DNA of a eukaryotic 
cell In these methods, an altered eukaryotic gene or gene fragment is introduced mto 
the eukaryotic cell. The introduced eukaryotic gene or gene fragment has been altered 
by homologous recombination using the methods of this disclosure. 
20 For example, extrachromosomal DNA including the eukaryotic gene or gene 

fragment is introduced into a bacterial cell having an intrachromosomal gene encoding a 
recombinase operably linked to a de-repressible promoter. The bacterial cell » then 
induced to express the recombinase. A nucleic acid molecule capable of undergoing 
homologous recombination with the eukaryotic gene or gene fragment is introduced 
25 into the bacterial cell. The eukaryotic gene or gene fragment undergoes homologous 
recombination with the nucleic acid, and altered eukaryotic gene or gene fragment may 
then be isolated and introduced into a eukaryotic cell. 
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In one embodiment, the eukaryotic gene or gene fragment introduced into the 
eukaryotic cell is located on a bacterial artificial chromosome. The eukaryotic gene or 
gene fragment is capable of undergoing homologous recombination with a target gene 
in the cell, thereby altering the nucleic acid sequence of the eukaryotic cell's 
5 intrachromosomal DNA. In specific, non-limiting examples, the eukaryotic cell is a 
mammalian cell, an embryonic stem cell, or a zygote. 

Also disclosed are mutant mammals which have had one or more of their genes 
altered by homologous recombination with a bacterial artificial chromosome carrying a 
eukaryotic gene or gene fragment that has been altered by the disclosed methods. The 
L 0 gene alteration can introduce a recombinase into the mutant mammal, such as a site- 
specific recombinase. 

A mobilizable lambda DNA is also disclosed herein that is isolated as a mini- 
lambda prophage. ThemobilizablelambdaDNAcanbetransformedintoanybacterial 
strain of interest. The lambda DNA integrates into the bacterial chromosome to 
15 generate a defective prophage that expresses the recombinase. 

The present disclosures provide methods of enhancing the efficiency of 
homologous recombination. The disclosures will be better understood by reference to 
the following explanation of terms used and detailed description of methods for 
carrying out the invention. 

« 

Unless otherwise noted, technical terms are used according to conventional 
usage Definitions of common terms in molecular biology may be found in Benjamin 
Lewin, Genes V, published by Oxford University Press, 1994 (ISBN 0-19-854287-9); 
Kendrew et al. (eds.), The Encyclopedia of Molecular Biology, published by Blackwell 
Science Ltd., 1994 (ISBN 0-632-02182-9); and Robert A. Meyers (ed.), Molecular 
Biology and Biotechnology: a Comprehensive Desk Reference, published by VCH 
Publishers, Inc., 1995 (ISBN 1-56081-569-8). 
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Explanation of terms used 

3' overhang: two nucleic acid sequences which when annealed are partially 
double-stranded and partially single-stranded. The single-stranded end or ends extend 
away from the double-stranded segment in a 5' to 3' direction. 
5 5' overhang: two nucleic acid sequences which when annealed are partially 

double-stranded and partially single-stranded. The single-stranded end or ends extend 
away from the double-stranded segment in a 3' to 5' direction. 

Arabinose: A simple 5-carbon sugar metabolized by E. coli. In one 
embodiment, it is used as a chemical to inactiveate repression and to induce and activate 
10 expression from the promoter pBAD. 

Attachment Site (att): A site specific site for recombination that occurs on 
either a phage or a chromosome. An attachment site on lambda is termed "attP", wmle 
anattachmentsiteofabacterialchromosomeis-attB." Integrase mediated 
recombination of an attP site with an attB site leads to integration of the X prophage m 
15 the bacterial chromosome. 

Bacterial artificial chromosome (BAC): bacterial artificial chromosomes 
(B ACs) have been constructed to allow the cloning of large DNA fragments in E. coli, 
. as described in O'Conner et al., Science 244: 1307-12, 1989; Shizuya et al., Proc Natl. 
Acad Sci. USA 89: 8794-7, 1992; Hosoda et al., Nucleic Acids Res. 18: 3863-9, 1990; 
and Ausubel et al. This system is capable of stably propagating mammalian DNA over 
300 kb In one embodiment, a BAC carries the F replication and partitioning systems 
that ensure low copy number and faithful segregation of plasmid DNA to daughter cells. 
Large genomic fragments can be cloned into F-type plasmids, making them of use in 
constructing genomic libraries. 

Beta: the 28 kDa lambda Beta ssDNA binding polypeptide (and nucleic acid 
encoding lambda beta) involved in double strand break repair homologous 
recombination. DNA encodingBeta (bet) and polypeptide chains having lambdaBeta 
activity are also referred to herein as bet. See Examples 1 and 14 and references therein 
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fa discussion The lambda Beta protein binds to standee DNA and promotes 
for discussion. „j rtt , cn fTiNA See Karakousis et al, J . 

saturation of complementary single strand regions o S 
MbL Biol. 276: 721-733, 1998; Li et al., J. MoL Biol. 276. 721-733, 1998, 
PNAS 96: 4279-4284, 1999. 

* ■ ♦ «f«Pta include those variants that maintain 
Functional fragments and variants of Beta includetn 

as described herein, and in the publications referenced herein. It recog^ 

ntin. Beta may be considerably mutated without materially altermg the 
gene encoding Beta may be co v^tinn function of lambda Beta. First, 

qsDNA binding function or homologous recombmation function 
ssUNA Dinning Afferent codons encode the 

0 the .enetic code is well-known to be degenerate, and thus different co 

•„ Second even where an amino acid mutation is introduced, the 
same amino acids. Second, even wner 

nation may be conservative and have no matena, tmpac, on the e»nh 
^bdaBeta. SeeStiyer,Bioc„emishy3rdEd.,(c,19 8 S. 
Beta polypeptide chin may be deleted withoo, hnpairing or ehmrnaung ,.a ssDNA 
Betapotypepu fimction Fourth, insertions or additions 

„ bindingprotehr taction, or its recombmahon J ' ugs . 

^bemadeinthelambdaBeUpolypeptidecham-forex^ple, d .* 

.lontimpairhrgoreHntinatingitaessentialnmotion, Ausube, e. al, .997. 

• f-rixr a iMn c e ii s using DNA-coated micro-projecnles. 
THolistics: Insertion of DNA mto cens using ^ 

20 ^described and delined ha U.S. Pat No. 4,945,0,0, which is here* meowed 



by reference. 



:ence. 

. ™v av A mece of DNA lacking internal, non-coding 
cDNA (complementary DNA), Aptece * c£ „ 

sclents (introns) and regulatory sa,uences that detennin P 

be synthesized in the laboratory revere transcription from messenger 

25 from cells. 

Cosmld: Artificially constructed cloning vector containing the cos gene of 
ph^^Coantidacanb.paekagedinUmbdapn^partic.^n.eotion^ 

into bacterial hosts mplasmid vectors. 
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Cre- The Cre recombinase is a site-specific recombinase. It recognizes and 
binds to specific sites called taP. Two loxP sites recombine at nearly 100% efficiency 
in the presence of Cre, thus, permitting DNA cloned between two such sites to be 
efficiently removed by the Cre-mediated recombination. 
5 De-repressible Promoter: When a repressor is bound to a de-repressible 

promoter transcription is substantially decreased as compared to transcription from the 
de-repressiblepromoterintheabsenceofrepresso, By regnlating the binding of the 
repressor, such as by changing the environment, the repressor is released from the de- 
repressible promoter, and transcription increases. In this definition , a de-repressmle 
10 promoterdoesnotr^reanactivatorfortranscription. One specific, non-bmitmg 
example is the PL promoter, which is regulated by the repressor cl, but is not activated 
by an activator. (The arabinose promoter is not a simple de-repressible promoter as 
arabinose inactivates the repressor AraC and converts it to an activator ). 

In one embodiment, the de-repressible promoter is a temperature sensitive de- 
15 repressive promote, For example, by increasing the temperature, the repressor is 
released from the promoter, or can no longer bind to the promoter with a high affimty, 
and transcription is increased from the promote, One specific, non-limiting example is 
the induction of PL promoter activity by increasing the temperature of the cell. 
Increased temperature inactivates the temperature-sensitive repressor cl, allowing genes 

20 that are operably linked to^ 

skill in the art can readily identify a repressive promoter. 

In one embodiment, a de-repressible promoter is auto-regulated. One specific, 
non-limiting example of an auto-regulated de-repressible promoter is PL. If only one 
copy of a gene encoding cl is present, yet many copies of the PL promoter are present, 
25 expressionofclisupregulatedsuchmatfranscript^ 

promoters. 

Double-strand break repair recombination: A type of homologous 
recombination exemplified by the lambda recombination proteins Exo, Beta and Gam, 
and shared by numerous other recombinase systems. A double strand break is the . 
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initiation point for concerted action of recombination proteins. Typically, an 
exonuclease degrades processively from the 5' ends of these break sites, and ssDNA 
binding polypeptide binds to the remaining 3' single strand tail, protecting and prepanng 
the recessed DNA for homologous strand invasion (Szostak et al., Cell 33: 25-35, 1983, 
5 Little, 1. Biol. Chem. 242: 679-686, 1967; Carter et al., J. Biol. Chen, 246: 2502-2512, 
1971- Lindahl et al., Science 286: 1897-1905, 1999). Examples of ssDNA binding 
polypeptides which bind to either ssDNA and/or dsDNA with 3 ■ overhangs and 
promote double-strand break repair recombination include lambda Beta, RecT of*. 
coli, Erf of phage P 22, and Rad52 in various eukaryotic cells including yeast and 
10 mammalian cells. 

Electrocompetent: cells capable of macromolecular uptake upon treatment 

with electroporation. 

Electroporation: a method of inducing or allowing a cell to take Bp 
m acmmoleou>es by applying elechic fields to reversibly petmeabUiae «"» 
U Various methods and apparatuses used are further defined and described m: U.S. Pat. 
No 4 695,547; US. Pa.. No. 4,764,473; U.S. Pat No. 4,882,28; and U.S. Pat. Nos. 
4,946',793; 4,906,576; 4,923,814; and 4,849,089, all of which are herein incorporated by 
reference. 

Eukaryotic cell: a cell having an organized nucleus bounded by a nuclear 
20 membrane. These include lower organisms such as yeasts, slime molds, and the like, as 
well as cells from multicellular organisms such as invertebrates, vertebrates, and 
mammals. They include a variety of tissue types, such as: endothelial cell, smooth 
muscle cell, epithelial cell, hepatocyte, cells of neural crest origin, tumor cell, 
hematopoetic cell, immunologic cell, T cell, B cell, monocyte, macrophage, dendntic 
25 cell, fibroblast, keratinocyte, neuronal cell, glial cell, adipocyte, myoblast, myocyte, 
chondroblast, chondrocyte, osteoblast, osteocyte, osteoclast, secretory cell, endocrine 
cell, oocyte, and spermatocyte. These cell types are described in standard histology 
texts such as McCormack, Introduction to Histology, (c) 1984 by J.P. Lippincott Co, 
Wheater et al, eds., Functional Histology, 2nd Ed., (o) 1987 by Churchill Livingstone; 
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Fawcet, e, a.., eds„ Bloom and *— = A ° f " W ^ * ^ 

and Wilkins, aU of which are incorporated by reference in tar entnety. 

EM . th e. x onuc.eas=of.ambda(and.henuc.eioacidenc„dingd.eexonuo 1 ea S o 

protein, involved in donble strand break reparr homologoos recombrnabon. See 
5 Example land references therein for discussion. 

Gam: a lambda protein (and nocleic acid encoding Gam) involved in donble 
stran d break repair homology recombination. ,. is beHeve ,o inhibit 
««* snob as mar encoded by me reoBCD and shcCsysrem of E. col, SeeExamp* 
, 7 »d 14 and references torn for drscnssion. Gam taction, when expressed m me 
10 JeU.is — .oxiCo.becen.andpr.ven.sgrowm. -^—^,0 
over its expression are always required. As described berem, PL and cl 857 are 
regulate Gam expression 

j . . AfPvn aT1 <i Gam* As discussed for Beta (see 
Functional fragments and valiants of Exo and nam. » 

"Functional fragments And Variant Of Bern"), it is recognized that genes enco mg 
,5 ExoorGammayneconsiderablymu^tedwiftontmalcnmiyaltermgdiennincuon, 

because of genedc code degeneracy, conserve amino acid substations, nonccitical 
deletions or insertions, etc. Unless me context makes otherwise dear, die, srmlambda 
Exo to, or lambda exonuclease are all intended to include me native .ambda 
exonuclease, and all fragment and variants of lambda exonucleuse. 

Extrachromosomak no. incorporate* into the chromosome or chromosomes of 
a cell In the context of nucleic acids, extrachiomosomal indicates an DNA 
oligonucleotide mat is no, covalentty incorporated into the chromosome or 
chromosomes of a cell I-trachromosomal refers to material such as an 
oligonucleotide tha, is incnrporated into die chromosome or chromosomes of a c* 
2S such as a DNA oligonucleotide covalently incorpora,ed mm the chromosomal DNA 

cell. 

Homo.ogou S arm, nucleotides at or near 5' or V end of a polynucleotide tvhich 
„ identical or similar in sequence to me tmget nucleic acid in a eel, and capable of 
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m ediabnghomologousrecomb m a ta w it h te «a Ig . t nuc.e i cac W .Ho m olo g ous™s 
I a,s„ Led » as homology m * one embodhnen,, a homo^ amr memoes - 
le as. 20 bases of a sequence homologous to a nucleic acid of interest fc anouaer 
— an, the homo.ogy area includes a, leas, 30 case pans of a seance 
5 logons* a nun,* acid of mterest. h, ye. another embodnt. 

shades a, leaa. 40 base pair, of a sequence bonbons «o anuclerc aa of t^h. 
a ftrtber embodhnen,, a homology ann incbrdea fiom about 50 .o about 100 baae per, 
of a sequence homolo g ous to a nucleic acid of interest. 

Homdogous recombinadon: an exchange of homologous polynucleotide 
10 segments anywhere along a lengto of two nucleic acid molecules. 

Host cent a cell mat is used in * techniques such as DNA cloning to recerve 
exogenous nudeic acid nro.ecu.ea. In one embodiment a bo, ce.. is used «o u— or 
aUow the reproduction of a vector, or to fccih.a.e the manipulation of nuoletc acrd 
Macules in vm-o. A host cell can be a proxaryodc or a eutayotic celb 
15 HVo-mediated gen. transfer: a memo4 of macromoiecular transfer rnto cells 

^ taactivated hemagglntmaung vims offapan and hpoaotnea, as deaenhed . 
Morishita e. ai., J. Chn. hrveat. 9, 2580-2585, .993; Monsmta e, a.., J. Chn. hrveat. 
94-978-984 1994; which are herein incorporated by reference. 

2Q ^^rea^bysomechangeinnreenvironmentofmeoeU. Examples o — 
pill abound innamre, and a broad range of envhonmen ^hormonal changes 

may activate or repress them. 

Mate* An "isolated" biological component (such as a nucleic acid or protein) 
te been substantia,!, separated or purified away fiom omer biological components tn 
25 the ce,, of the organism m which me component nalurally occurs U 

dnomosomalandexfia-cbromosoma.DNA^EHA.a.dpro^. Thusnuclcc 

,» j tt, p t^rm also embraces nucleic acids and proteins 
by standard purification methods. The term also emorace 



WO 02/14495 



PCT/US01/25507 



19 



5 



10 



15 



prepared by recombinant expression in a bos. cell as well as ehemieafty synced 
nucleic acids. 

Linear p.asm.d vector: a DNA sequence (!) containing a bacterial plasnrid 
ongin of replication, (2) having a free 5- and 3' end, and (3) capable of cncularizing 
Jreplicating * a baoteriai plasmid by joining its free 5> aard 3' en s. Examp esof 
linei a plasnrid vectors include tbe line»=d pBlneserip. vee.or and Unearned pBR322 
vectors described herein. 

UpofeCon: Tbe process of ntacromolecular transfer into ceils using liposomes. 
See U S Patent No. 5,65 1 ,981 , which is herein incorporated by reference. 

Min..ambda: A derivative of lambda (X) wherein nroa.of.hc viral lytic genes, 
ind udmg«tose required for replication and lysis, are deleted. Anrini-lambdamauriams 
fte red Lions ^ Exo, artd Gam, for homoiogous recombination and — 

^ a t „ ,h ititctrrase (int). and excisionase (xis)) to 
the integration/excision functions (e.g. att, integrase (mi), n 

insert and excise its DNA from the chromosome. 

Nucleic acid: a deoxyribonucleotide or ribonucleotide polymer in either single 
or double stranded form, including known analogs of natural nucleotides unless 
otherwise indicated. 

OUgo.ucleo.idet a aingle-shanded nucleic acid ranging in .ength from 2 to 
about 500 baaea, for example, polymrcleoudea mat contain , ,eaa. 2 0 or 40 nucleus. 
, Oligonucleotides are often syatmeric bu, can also be produced ftom nabarauy occumng 
polynucleotides. 

Operably .inked: A firs, nucleic aoid seqnenoe is operably nnked with a 
.econd nno.eio acid sequence when me firs, nueleio acid sequence is placeri ,n a 
Phonal relauonship wim me second nucleic acid sequence. For tnatance a promoter 
5 is operably Wed to a codmg sequence if me promoter aff«e.s me fransenphon or 
explsion of me coding sequence. <3enera.lv, oper*.y linked DNA seqneneea are 

rt where necessa* .0 jom W o prodding radons, m me same readm 

fiame. 
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Phagemid artificial chromosome: Also referred to as PI artificial 

A tvrce of artificial chromosome allowing for stable cloning of very large 
chromosome. A tvpe of artificial proc. Natl. Acad. Sci. USA, 92: 

DNA fragments. Further described m Shepherd, et ai., rr 
2629 1994- Iannou et al., Nature Genetics, 6: 84-89, 1994. 
, ' Phage-based recombination systems: Bacteria such as E. call encode their 

5 Phage oas ,,v,; r h are used in repair of DNA damage and 

own homologous recombination systems, which are used in rep 

own nomoiogo bactena often 

tn m aintain a functional chromosome. The viruses or pnag 

to maintain a tun fimctions Phage carries the Red recombination system, 

carry their own recombination functions, r nage a. 

i thphacterial recombination functions or 
' These phage systems can work with the bactenair 

10 independently of them. 

rl promoter: ft. major .=lW promoter of bacteriophage lambd* Once 
fce lambdaDNA is hrcotporated nr.o * bacteria, ch_c 
promoter is —by repressedby .be cl repressor. Upon —on of me ol 
. ^.fo^.obyb^aboo.ofaf^^ve^— 
„ ^tePLpromo.orisaorivafeifoadfng.o^aionof.ambdagenes. SoeP.g.1, 

Sanbrook ot al. Cb. 2, Stiyor, and Court and Oppenbcm. 

Plasmid: Autonomously repbcating, extrachromosomal DNA molecules, 
aUrin^mu.ononnalbao.erialgenomeandnonessenria.fotbactenalceUsu.wa, 

under nonselective conditions. 

any long* Tbetefore, a po.ynnCeoUde hrcludes molocuUs wh.ch ate 1, 5<U^ 
leobt .ong (oligonucleotides, and a*o nuoleoridea as long as a fuU length cPNA. 

P olype P «de: any cbain of amino acids, regardless of length or post- 
national modification (e.g, glycosylarion or phosphorylauon). 

■ i,^<.atnembrane-bound,sttucturattydtscrete 
25 p ro kaiyote: Cell or organism lackmg a memoran 

nucleus and other subcellular compartments. 

p r „„es and primers: A nucleic aoid pro* comprises an isolated nucleic acd 
^lade.Jie^or^mo^.Typioa.laho.mofudemdroacbve 
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1S oto P es, ligands, chemilummescent agents, and enzyn.es. Methods for labeling and 
guidance in the choice of labels appropriate for vanous purposes are discussed, , g ., 
Sambrook et al, (1989) and Ausubel et al, (1997). 

•j cforaVilv nNA olisonucleotides 15 nucleotides 
Primers are short nucleic acids, preferably DNA ongonu 

« more in lengfir Primers may be — «o a commentary target DNA sW by 
andeic acid hybndization ,o form a hybrid between the primer an me targ « DNA 

^d.benngi.henseofaDNApo^etaseenzyn.e. Primer pans (one nne« 
offll e target nno.eie acid se,nenee, canbe nsed for ampliation of a nucleic actd 
sequence, by me polymerase chain reaction (PGR) or ofiier nuc.etc-actd 
amphfication methods known in the art. 

Methods forpreparhigandusingprobesandprimers are described, for example, 
in Sambrook - al, (1989), Ausubel et al, (1987). PGR primer pairs can be denved 

for example by using computer programs mtended for that 
from a known sequence, tor example, oy ui> 5 

5 purpose such as Primer (Version 0.5, © 1991, Whitehead Institute for Biomedical 
Research, Cambridge, MA). Under appropriate conditions, me specificity of a 
Alomar probe or primer increases with its length. Thus, in order * obtam *ea« 
apecffiouy, probes and primes may be selected ft* comprise 
more oonseoutive nucleotides of related cDNA or gene sequence. 
, 0 Purified! The.ermpnrifieddoesno.iequireabsolmepurity;rather,i.ia 

' tended as a reform. Thus, for example, a purified ^BeUpr^o, 
ssDNA binding polypeptide is one in which Ae protein is more ■»«« * - 
protein ism its natural en— twithin a cell. Preferabiy.apreparahonoftabda 
Beta is purified such that die po.ypep.ide represent at leas, 50% of fire total protem 

25 content of the preparation. 

RecA: The RecA protein is a central protein that has an activity as in the 

reoombination function of* col, Homologues are found in al, other organtsms. RecA 
protein allows two homologous DNAs to find each omer among non-homologous 
MAS and then trade or transfer sfianda wifir each omer. This exchange occurs by 
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RecA binding to a single stranded region in one of the DN As and using that strand to 
search for its dsDNA homolog, binding to the dsDNA and causing the single strand to 
pair with its complement in the dsDNA ultimately displacing the identical strand of the 
duplex. This strand transfer generates a key intermediate in theRecA-mediated 

5 recombination process. 

recE recT genes and the Rac prophage: E. colt and other bacteria contam in 
their chromosomes remnants of viruses. These viruses or prophages are for the most 
part defective and may contain only a few genes of the original virus. In E. coh, one 
defective prophage is called Rac. Two genes, recE and recToi the Rac prophage, 

0 encode homologous recombination functions. These genes are normally silent but the 
sbcA mutation activates their constitutive expression. Thus, the sbcA mutant is active 
for recombination. 

Recombinases: proteins that, when included with an exogenous targeting 
polynucleotide, provide a measurable increase in the recombination frequency between 
1 5 two or more oligonucleotides that are at least partially homologous. 

Selection markers or selectable markers: nucleic acid sequences which upon 
intracellular expression are capable of conferring either a positive or negative selection 
marker or phenotypic characteristic for the cell expressing the sequence. The term 
"selection marker" or "selectable marker" includes both positive and negative selection 
20 markers. A "positive selection marker" is a nucleic acid sequence that allows the 

survival of cells containing the positive selection marker under growth conditions that 
kill or prevent growth of cells lacking the marker. An example of a positive selection 
marker is a nucleic acid sequence which promotes expression of the neomycin 
resistance gene. Cells not containing the neomycin resistance gene are selected agamst 
25 by application of G418, whereas cells expressing the neomycin resistance gene are not 
harmed by G418 (positive selection). A "negative selection marker" is a nucleic acid 
sequence that kills, prevents growth of or otherwise selects against cells contammg the 
negative selection marker, usually upon application of an appropriate exogenous agent. 
An example of a negative selection marker is a nucleic acid sequence which promotes 
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expression of the thymidine kinase gene of herpes simplex virus (HSV-TK). Cells 
expressing HSV-TK are selected against by application of ganciclovir (negative 
■ selection), whereas cells not expressing the gene are relatively unharmed by 

ganciclovir. The terms are further defined, and methods further explained, by U.S. 
5 Patent 5,464,764, which is herein incorporated by reference. 

Selectable phenotype: A cell with a selectable phenotype is one that expresses 
a positive or negative selection marker. 

Sequence identity: the similarity between two nucleic acid sequences, or two 
amino acid sequences is expressed in terms of the similarity between the sequences, 
10 otherwise referred to as sequence identity. Sequence identity is frequently measured m 
terms of percentage identity (or similarity or homology); the higher the percentage, the 
more similar are the two sequences. 

Methods of ahgnrnent of sequences for comparison are well-known in the 
art. Various programs and alignment algorithms are described in: Smith and Waterman, 
Adv Appl. Math. 2:482, 1981; Needleman and Wunsch, /. Mol. Bio. 48:443, 1970; 
Pearson and Lipman, Methods in Molec. Biology 24: 307-331, 1988; Higgins and Sharp, 
Gene 73:237-244, 1988; Higgins and Sharp, CABIOS 5:151-153, 1989; Corpet et al., 
Nucleic Acids Research 16:10881-90, 1988; Huang etal., Computer Applications in 
Biosciences 8:155-65,1992; and Pearson et al., Methods in Molecular Biology 24:307- 
31,1994. Altschul et al. (1994) presents a detailed consideration of sequence alignment 
methods and homology calculations. 

The NCBI Basic Local Ahgnrnent Search Tool (BLAST) (Altschul et al., J. 
Mol. Biol. 215:403-410, 1990) is available from several sources, including the National 
Center for Biological Information (NBCI, Bethesda, MD) and on the Internet, for use in 
25 connection with the sequence analysis programs blastp, blastn, blastx, tblastn and 

tblastx. It can be accessed at the NCBI website, together with a description of how to 
determine sequence identity using this program. 
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Homology of lambda Beta, Exo and Gam, and ssDNA binding protein, typically 
possess a, teas, 60% sequence idenuty counted over (nil-length aliment with the ammo 
acid sequence of to protein being evatuated (to, is, lambda Beta, Exo or On or 
siding protein seen as P22 Brf, RecT, and Rad52) usmg the NCB! Blast 2.0 
gapped bias* set to defhufi p— . For oonrpansons of amino aord fences of 

™ ■ the Blast 2 sequences function is employed using the 

greater than about 30 amino acids, the Blast l sequ 

default BLOSXJM62 manix se, to default parameters, (gap existence cos. of 11 and a per 

,■ ■ t™i,, m Hdes (fewer than around 30 amino acids), 
residue gap cost of 1). When aligning short pepbdes (tewer m 

ta ahgLent should be performed using me Blast 2 sequences fnoc on, employtng^ 
PAM30 matrix se. to default parameters (open gap 9, extension gap . penalues). Protetns 
with even grea.r similarity .o me reference sequence wil. show increaamg percent 

„ ,eas. 90S. a. least 95%, a. ,ea, 98%, or a. lea, 99% sequence tdemrty. When 
ton me entfie sequence is behtg compared for sequence iden«i.y, homologs ***** 
possess a. lea, 75% sequence idem* over short windows of !0-20 annuo acds^may 
possess sequence identities of a. leas. S5% or a. ,eas. 90% or 95% depending « men 
Llarity .o me reference sequence. Methods for detemfinfitg sequence idenhly over 
sui short windows are described at me NCBI website 

One of skill in the art will appreciate to. fltese sequence identity ranges are 
provided for guidance only; it is entirely possible mat strong* significant homologs or 
other variants could be obtained to. fall outside of fire ntnges provtded. 

Single-handed DNA (ssDNA) and doubfit-stranded DNA (dsDNA)t 
ssDNA is DNA fit a single polynucleotide chain; to DNA bases are no. involved m 
^ri^panfi.gwn.^merpo.ynno^neofiam. ™*£Z' 
, . more complementary polynuCeoride obains, in which to two polynucleofide >« 
a, M paruaily Walson-CriOc base^ afied to each otor. Note to. a segment of DNA 
may be partiany ssDNA and partiafiy dsDNA, for example if there are gaps m one 
po^nJeoddechainof asegntentofdsDNA.orto.areS. 0.3' overhangs. ssDNA 
. m d dsDNA may contain nucleotide analogs, nonnatnrally occming or syntofio 
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nucleotides, biotin, or epitope or fluroescent tags. ssDNA or dsDNA may be labeled; 
typical labels include radioactive isotopes, ligands, chemiluminescent agents, and 
enzymes. 

Site-specific recombinase: a recombinase wbose activity is limited to DNA of a 
5 specific sequence. Examples include the Cre, FLP and FLPE recombinases. Lambda 
Int is specific for the art sites. Cre recombinases are site-specific for loxp recombination 
sites in a DNA sequence, whereas FLP and FLPE recombinases are site-specific for 
FRT recombination sites. A recombination site is a nucleic acid sequence specifically 
recognized by a recombinase. For example, the Cre recombinase specifically binds a 
1 0 loxp recombination site, and thereby induces recombination. 

Target nucleic acid sequence: the nucleic acid segment which is targeted for 
homologous recombination. Typically, this is a segment of chromosomal or 
extrachromosomal DNA in a cell. Extrachromosomal DNA harboring target nucleic 
acid sequences may include episomal DNA, plasmid DNA, bacterial artificial 
15 chromosome, phagemid artificial chromosomes, yeast artificial chromosomes, cosmids, 
and the like. The target nucleic acid sequence usually harbors a gene or gene fragment 
which will be mutated in some fashion upon homologous recombination. Examples of 
target nucleic acid sequences include DNA sequences surrounding the tyr 145 UAG 
amber mutation of galK, as described in Yu et al., PNAS 97: 5798-5983, 2000, and in 
20 Example 3 of this application; the second exon of mouse hox 1.1 gene, as descnbed m 
U.S. Patent No. 5,464,764; the human hemoglobin S gene mutation as described in 
Example 15 of this application. 

Targeting frequency: The frequency with which a target nucleic acid sequence 
undergoes homologous recombination. For example, extrachromosomal DNA is 
25 introduced into a eukaryotic cell. The extrachromosomal DNA has sequences capable 
of undergoing homologous recombination with a target intrachromosomal DNA 
sequence. After introducing the extrachromosomal DNA and allowing homologous 
recombination to proceed, the total number of cells may be determined, and the number 
of cells having the target DNA sequence altered by homologous recombination may b e 
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determined. The targeting frequency is the number of cells having the target DNA 
sequence altered, divided by the total number of cells. For example, if there are a total 
number of one million cells, and 1,000 of these cells have the target DNA sequence 
altered, then the targeting frequency is 1 in 1,000, or 10 . 

Transformed: As used herein, the term transformation encompasses all 
techniques by which a nucleic acid molecule might be introduced into such a cell, 
including transfection with viral vectors, transformation with plasmid vectors, and 
introduction of DNA (including DNA linked to Beta protein) by electroporation, 
lipofection, and biolistics. 

Upstream: refers to nucleic acid sequences that preceed the codons that are 
transcribed into aRNA of interest. Similary, downstream refers to nucleic acid 
sequences that follow codons that are transcribed into aRNA of interest. 

Variants of Amino Acid and Nucleic Acid Sequences: The production of 
lambda Beta, Exo or Gam, or other ssDNA binding polypeptide can be accomplished in 
15 avarietyofways. DNA sequences which encode for the protein, or a fragment of the 
protein, can be engineered such that they allow the protein to be expressed in eukaryotic 
cells, bacteria, insects, and/or plants. In order to accomplish this expression, the DNA 
sequence can be altered and operably linked to other regulatory sequences. The final 
product, which contains the regulatory sequences and the nucleic acid encoding the 
20 therapeutic protein, is operably linked into a vector, allowing stable maintenance in a 
cell. This vector can then be introduced into the eukaryotic cells, bacteria, insect, 
and/or plant. Once inside the cell, the vector allows the protein to be produced. 

One of ordinary skill in the art will appreciate that the DNA can be altered in 
numerous ways without affecting the biological activity of the encoded protein. For 
25 example, PCR may be used to produce variations in the DNA sequence which encodes 
lambda Beta, Exo or Gam, or other ssDNA binding proteins. Such variants may be 
variants that are optimized for codon preference in a host cell that is to be used to 
express the protein, or other sequence changes that facilitate expression. 
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Two types ofcDNA sequence variant may be produced, In the firs, type, fire 
variation in the cDNA sequence is no. manifested aa a change to «- amino aord 
sequence of fire encoded polypeptide. These silent variations am ahnply a reflection of 
,he degeneracy of the genetic code, fa the second typo, .he cDNA sequence ^nation 
doesresnl.inachangemuaeammoacidseqn^eofu.eencodedpro.em. Insuch 

cases, .he variant cDNA sequence produces a variant polypeptide sequence In order to 
preserve fire Actional asrd immunologic identity of me encoded polypeptide, auch 
^tao acid substitutions arc ideally conservative in highly conserved regmna. 
Conservative substitutions replace one ammo acid with another amino ac,d that . 
stofiar in siae, hydrophobic,*, etc. Outside of highly conserved regrona, non- 
conservative substitutions can mom readily be made without affecting tacon of fire 
protein. Examples of conservative subnotions are shown in Table 1 below. 
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Table 1 



Original Residue 


Conservative Substitution 


Ala . 


Ser 


Arg 


T 


Asn 


Gin, His 


Asp 


Glu 


Cys 


Ser 


Gin 


Asn 


Glu 


Asp 


Gly 


Pro 


His 


Asn; Gin 


T1 A 

lie 


Leu, Val 


Leu 


T1p-Val 


Lys 


Arg; Gin; Glu 


Met 


Leu; He 


Phe 


Met; Leu; Tyr 


Ser 


Thr 


Thr 


Ser 


Trp 


Tyr 


Tyr 


Tip; Phe 


Val 


Die; Leu 



25 



Variations fa the cDNA sequence that resnlt in amino acid changes, whether 
conservative or not, should he minimized in order to preserve Ore Smchona. and 

^nnno^c identity of the encode, protein. The ^ F»« 
^yheaasessedhydet.rnunfagwhefaernisrecogniz^hyananubodytothepro.ern, 

a variant that is recognized hy snch an antibody is faununologically conserved. 
Particular examp.es of cDNA sequence variants introduce no more rtaan 20, and fewer 
to, 10 amino acid substitutions, into the encoded polypeptide. Variant ammo acrd 
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sequences may, for example, be at least 80, 90 or even 95% identical to the native 
amino acid sequence. 

Yeast artificial chromosome (YAC): A vector used to clone DNA fragments 
(up to 400 kb); it is constructed from the telomeric, centromeric, and replication origin 
sequences needed for replication in yeast cells (see Ausubel et al.). 



Use of the Lambda-encoded Red recombination system 
in recombineering mediated by a defective prophage 
Bacteriophage X contains a homologous recombination system termed Red, 
which is functionally analogous to the RecET recombination system of Rac. Like 
RecET Red recombination requires two genes: redo, or exo, which is analogous to recE, 
and redp (or bet), which is analogous to reel). Exo is a 5'-3' exonuclease that acts 
processively on linear dsDNA. Beta binds to the ssDNA overhangs created by Exo and 
15 stimulates annealing to a complementary strand but cannot promote direct strand 

invasion and exchange on its own. The recombination functions of Exo and Beta are 
again assisted by X phage-encoded Gam, which inhibits the RecBCD activity of the host 
cell 47 . X Red-mediated recombination events are 10 to 1000 times more efficient than 
those observed in recBC sbcBC or recD strains. Because homologous recombination is 
increased dramatically by the addition to the host of phage-encoded protein functions, 
this procedure is widely applicable to any E. coli strain and to other bacterial species as 
well. 

A defective X prophage-based system is disclosed herein for Red-mediated 
recombineering (see Fig. 2) . In this system, Gam, Beta, and Exo are encoded by a 
defective lambda prophage, which is integrated into the E. coli chromosome of a 
bacterial cell (e.g. E. coli) (see Fig. 6 and Fig. 9). Expression of Gam, Beta, and Exo is 
under the tight control of a de-repressible promoter. In the example shown, the de- 
repressible PL promoter is under the control of the temperature-sensitive X cI857 
repressor. At 32°C, when the repressor is active, expression of the PL promoter and 
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fcese genes is undetectable. However, when the cefts ate shifted ,o 42°C for a brief 15 
min period, the repressor is inactivated and the genes are expressed a. very high levels. 
In contrast, promoters that can be activated, which are present on plasmrds, are 
notoriously difficult to control and Red and Gam functions would be expressed even > 
S me absence of the inducer, such as acinose. I^w-level expression of Gam causes a 
RecBCD defect, a condition Utat reaults in plaamid instability and toss of cell viabritty. 

The tight regulation afforded by the prophage system, combined with the fact 
tat ta 1 ptomoter, which drives Gam and Red expression is a very strong promoter 
makes it possible to achieve recombination fluencies ta, axe a. leaat 50- -fold htgher 
„ than those found with ta piasmid-based system used previous* (see Muyrers et al, 
Nucleic Acids Re, 27: 1555-1557, 1999; Yu, D. e. al. Proc Natl Acad Sci US AW: 
5978-5983 2000). and several orders of magnitude higher tan previously descnbed 
■ stratamwhichlinearrecombinationhasbeensmdied.Theprophaget.self.s 

generally stable, unlike plasmids, and does not rely on ta presence of drug selechon 

15 for maintenance. 

FIG 2 illustrates the design of primers for amplification of a dsDNA 
combination cassette, and a strategy for generating recombinant DNA molecules and 
gene replacement. THe steps and advantages of this system are outlined below. 

Classical recombinant DNA technology or genetic engineering has primarily 
20 relied upon the presence of restriction enzyme cleavage sites to judiciously cleave DNA 
and the use of DNA ligase to covalently join different DNAs to make the recombmants 
wanted. Tho ability to do genetic engineering has been simplified by the polymerase 
chain reaction (PGR), which allows restriction sites to be incorporated into linear PGR 
products thereby allowing more precise positioning of those sites. All geneuc 

25 engmeermgtechnology*^ „ 
contam hundreds of kilobases of DNA. Examples include the bacterial chromosome and 
large genomic BAG clones. Even rare restriction enzyme sites. occur frequently on such 
large DNA molecules making the effort to use unique sites impossible. Furthermore, 
the in vitro manipulation of linear DNAs of this length is also extremely difficult. 
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Therefore, once large BAG cloning technology became available in E coll, 
Zm* on of the BAC clones became te primary problem. Urinaly a —on 
^generic engtneering rechnoiogy and Cassica, homologons recor^inanon — 
Z adapted to modify fine large genomic Cones. Classica! homologons —.ton 
m E coli depends npon sipnfcant (>500 bp) stretches of homCogy between DNAs. 

' Fig 2A depieta a fvpicaf genetic engineering protean! ,0 nrodify a target on a 
BAC Clone with a cassette and compares *». technology wrfia the recent* developed 
Inbineering teehno.og, that naea specia, phage recombinaoon fi— s. hr gene C, 
ITare nr^y steps reared for classica. engineering, and the final prodnc. cannot be 
mere arc m j ^j-j™ -fte critical advance 

engineered as precisely as by the new reeonrbmeenng technology 

aerate meonrbmation prodocts ushrg homologies of 50 bp (or less). Note drat fce 
Lge, homologies in Fig. 2A arrd Fig. 2B are represented by the smped boxes, hr *e 
Jhod onfiined in Fig. 2A, those boxes most be a. >eas, 500 bp long whereas . me 
met hod ontlmed in Fig. 2B, Urey need only be abon. 40 to abont 50 bp long. 
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Genetic engineering steps to generate BAC recombinant mclude: 

- CleavageofthecassetteDNAbyarestrictionenzyme. 

- Cleavage of target on plasmid by a restriction enzyme. 
(Notein this case the vector has been pre-engineered to contam target 

fragments.) 

■ Joining the cassette to the plasmid by DNAligase. 

- Transforming the DNA into cells. 

■ Selection of drug R clones and isolation of the plasmid. 
25 ■ Verification of the cloned cassette. 

. Transformation of the cloned cassette into the BAC strain. 

■ , „<. ^ > > tn introduce the cloned cassette into tne 

- Several recombination steps -> -> -> to introduce 

BAC. 
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Recombineering steps to generate BAC recombinants include: 
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- Generation of two primers (white and black arrows). 

■ Generation of PGR amplified cassette with flanking homologies (In this panel, 
exemplary striped homology segments shown are 50 bp long). 

■ induction of phage recombination functions into BAG strain or introduce of 
BAC DNA into strain carrying recombination functions. . 

■ Transformation of cells containing BAC and recombination functions with PGR 

cassette. 

■ Generation of recombinant in vivo, 
v Detection of recombinant: 

o Detection by selection or counterselection. 
o Detection by direct screening (colony hybridization), 
o Detection by deteoling a label on the nucleic acid (e.g. when DNA 
includes a DNA adduct or a marker such as biotin) 
Aa disclosed herein, the defective X prophage has dsn been transferred to tire 
BAC hos« strata DH10B so ma, ft nan be used for BAC engineering. The modrfied 
DH10B strain called DY380 nan be transformed with BAC DNA a, efncreno.es of 10 
m 10- The utility of DY380 cells for BAC engineering has been denoted by 
inhoduotag a 250 kbp mouse BAC that contains tire neuron.Upecific cotoe 2(Eno2) 
gene into DY380 cells by electxoporafion and tiren modifying the BAC by tatroducmg a 
Oppressing targeting cassette into tire 3' end of the Eno2 gene usmg Red 
recombination (see Example 20). The targeting cassette was PCR-amptified from a 
template plasmid using chimeric 63 nucleotide (n«) primers. The 3' 21 nt of each 
primer was bomoiogous to tire targeting cassette, while tire 5- 42 n« 
L las. exon of ^2 where tire cassette was «o be targeted (see Fig. 10). DY380 -lis 
were men e,ec«ropora«ed with the amplified targeting cassette and correctly targeted 
monies were obtained a, an efficiency approaching 10* following tire induction of Red 
expression; no targeted colonies were obtained in uninduced cells. 

A3 also disclosed herein, the modified full length BAC waa punfied and tnjectad 
in mouse zygotes and a BAC fransgemc line established. Two other transgenrc hues 
carrying a shorter 25 kbp subclone of the modffied Eno2 gene on pBR322 were also 
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established as controls. The 25 kbp subclone carries the entire modified Eno2 coding 
region as well as 10 kbp of 5' flanking sequence and 5 kbp of 3' flanking sequence. 
Tbe activity of the cre gene in the different transgenic lines was then assessed by 
crossing the mice to ROSA26 reporter mice. These mice carry a lacZ reporter that can 
be activated by Cre recombinase. In mice carrying the full length BAG transgene, Cre 
activity was detected in all ^-positive neurons. In contrast, not all W-positrve 
neurons expressed Cre in the transgenic mice carrying the smaller 25 kbp subclone, and 
the pattern of Cre expression varied between the two different 25 kbp subclone hnes. 
These results are consistent with previous studies showing that regulatory sequences 
, can be located hundreds of kilobases from a gene, and highlight the usefulness of BAG 
engineering for in this case generating Cre-expressing lines for use in conditional 

knockout experiments. 

Arabinose-inducible flpe or cre genes have also been introduced into the 
defective prophage carried in strain DY380. flpe is a genetically engineered^ that ha, 

5 a higher recombination frequency than the original^ (Buchholz et al., Nat Biotechnol 
16- 657-662, 1998, herein incorporated by reference). The site-specific recombmases 
Flpe and Cre are important tools used to add or delete DNA segments (e.g. drug 
cassettes). Flpe and Cre expression can be induced by the addition of arabinose and 
used to remove the selection marker from the targeted locus. This will be especially 

, 0 important in cases where the selection marker interferes with the expression of the 
targeted locus. It is important to note that even excision of the selectable marker by 
Flpe or Cre recombination leaves behind the fit or loxP site as a scar on the targeted 

locus. 

25 BAC modification without leaving markers or Scars ' at the target site and 

direct genomic modification 
A two-step procedure for BAC targeting has been developed wherein many 
kinds of mutations can be introduced into BACs without leaving behind a selectable 
marker, such as a drug selection marker, at the targeted locus. In one embodiment, a 
30 two-step procedure is utilized. This is exemplified in the following specific, non- 
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taiung sxan.pl, a PCR-generated targeting cassette * "** ^ n 

"1 .argid - a BAC or other DNA. Ce«s cental dre «*« cassette 
led to *e genomic DNA of*. BAC we then reformed wrdr a second 
targeting DNA to the same region. TMs cassette was desrgned to teptace the «*~ 

11 ^« Rvnlacine these newly transfonned ceus on 
«ihtle mutation, such as a small insertion. By piacmg m 

subtle muxauui , expression, which 

m edia with 7o/o sucrose, selective pressure was applied against J 

inverts sucrose to a bacteriotoxin (Muyrers et al., £M50 Rep 1: 239-243. 2000). 
converts sucrose d ^ Jflc5 _ 

Growth on sucrose plates thus selected for cells that n p 

* oPTim*. tareeting cassette containing the small insertion. 
neo targeting cassette with the second targetog ^ ^ 

Because spontaneous mutations occur in sacB to cause 

1 in 10 4 recombinants were identified among sucrose resistant colonies 

rs2i-— ----- rtr 

5 kinds ofgeneUc changes besides insertrons can also be generated, m g 

.——-«-«-— ^2 

DNLo.ecnl.snnhas.BAC.PAC.or^E.conctaornosont.^ontnn, 

accompanied marker. thedefectiveprop hage 

tte high « ^ ^ gmome or . BAC 

W (see Example 2V, The recombinants were found by screenmg rndrvrdua! cells 

noting oasser.es can also be ntade b y simpl y anneal two complement^etrc 
^DNA oligonucleotides together. As described berebr, a 70 bp targehng 

30 mutations a. rrerptenciea approaching one in a thousand electroporated cells. 
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rotations can thus be introduced 
stations corresponding to human disease-causing unions can 

Wo any human or mamalian gene carried on a BAG with ease and the affect of this 
mto any human h in a transeenic that carries a null mutation in the 

mutation on gene function assayed in a transgenic 

corresponding mouse gene. 

Cloning DM by gap repair 

. Hfi-nm-BACsbv Red-mediated recombination without 

Fragments can be subloned from BACs by K 

nxr A 1i eases Thus, any region of the B AO is 
the use of restriction enzymes or DNAhgases. , ' ^ 

L* ,o — g , - — , doea nordepend ontoe 

^d— — «>- — Subcloinng relies on gap reparr 

£ elds of a ..near plasnfrd vector with homologous sequences earned .ft. 

free ends ol a lino p -rha linear olasrnid vector vri*, for example, 

M example is shown in Figure S and ^ end , 

.^selectablemar.erandanongi^ 

The vector is generated, for example, by PCR amphn 

The 5' end of each primer has homology to the extremities of the BAC sequen 

The 5 enaore y a H +rt nrime and amplify the linear plasmid 

^i^-theS'endofeachprimensusedtopnmeanaduip j 

subcloned, the 5 enaoi p retrieved 

resistance (e.g. Amp ) pnenoxypc. ^ wRW^mot 

*.t.«i with a hieh copy vector such as pBluescnpi, 

fragments up to about 25 kbp ate subclon 

. P BR322 is used, fragments as l,ge a^out* ^ (- 
fragments were shown to be more accurately expressed m 
was the entire BAC clone see above). 
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' a mobilizable lambda prophage 
As diacW herein, —anon funcuons were expressed from fhefr nafrve 

- t "irito:— on-ptofrcienrOV,. 
30 6 and 9 is that BACs under study must be moved mto 
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cells before the BAG can be manipulated. In order to overcome this limitation, a novel 
prophage derivative has been generated that is isolated as a mini-lambda circle DNA 
carrying a selectable marker (e.g. a drug-resistance marker such as tet cassette) and 
containing the exo, bet, and gam genes under control of the temperature inducrble cI857 
repressor (Fig. 12). This mini-lambda can be transformed into any bacterial cell, such 
asaDHlOBcellthatcarryaBAC. The mini-lambda then integrates at the lambda 
attachment site to generate the defective prophage. This mobilizable prophage makes „ 
possible to introduce the prophage into BAC-containing DH10B libraries and obvrates 
the need to transfer the BAC to DY380 cells. 

Recombineering using ssDNA 
Recombineering, or the use of a recombinase to mediate recombination using 
homology arms sufficient to induce recombination, as disclosed herein, can be 
performed using single-stranded oligos as the targeting cassette). As descnbed m the 
Examples below (e.g., see Example 3), in E. coli, a single base change has been 
substituted in the galK gene and a 3 .3 kbp insertion removed from the galK gene usmg 
single-stranded oligos. Single-stranded oligos have also been used to cure 5 afferent 
TnlO insertions at different places on the E. coli chromosome. Recombineenng usmg 
single-stranded oligos is very efficient with up to 6% of the electroporated cells bemg 
recombinant. Whereas Exo, Beta, and Gam facilitate recombination of PGR amplified 
dsDNA cassettes with flanking homologies, only Beta is required for ssDNA 
recombination (see Fig. 13). Maximum recombination is achieved with 
oligonucleotides of about 70 bases in length, although oligonucleotides of about forty to 
sixty bases in length can also be used to achieve recombination, albeit at a 5-fold lower 
frequency. In one embodiment, ssDNA of about 40 to about 70 nucleotides in length rs 
utilized. In another embodiment, ssDNA of about 70 to about 100 nucleotides in length 
is utilized. In a further embodiment, a ssDNA of about 70 to about 1,000 nucleotides « 
len gth is utilized. Interestingly, Beta-mediated recombination activity is less effrcrent 
when ssDNA molecules are about 1,000 bases in length. In yet another embodunent, 
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the ssDNA is labeled, such as with a biotinylated nucleotide, a methylated nucleotide, 
or a DNA adduct. 

Recombination with either of two complementary DNA otigos has revealed mat 
ahhough either strand can be efficiently used for recombination, one strand . more 
competent for recombination than the other. This strand bias has been examined 
several positions around the bacteria, chromosome with me result that to preferred 
strand correlates with the lagging strand of DNA replication for each site tested. 
Without being bound by theory, these results indioated drat strand bias is assoctated 
with mo replioation direction through tho region being targeted and mat ssDNA 
, combination oenrs efficiently near the replication fork. The process of DNA 
rephoation results in transient regions ofssDNA that may be accessible to Beta- 
med iated annealing of tire ssDNA oligo. Although recombination occurs on me leadurg 
stmnd the increased recombination efficiency of the lagging strand ohgos may reflect 
toe increased ftequency of single-stianded regions during lagging versus leading strartd 
5 synthesis (Fig. 13). DNA polymerase and DNA ligaae could men complete the jotntng 
of the annealed oligo to the lagging strand. Without being bound by theory, Are 
inched frequency of ssDNA recombmauon probably reflects the fact that ssDNA 
^bmationocouraton^asimptemeetoniamthandsDNAreconJbmation. The 

ssDNA recombination may require only annealing of on. single-s«rand«l ohgo to 
20 single-stranded regions in the replicating target DNA. Moreover, ssDNA 

tK ombination also occurs in yeas, wtflr a atiand biaa mat may also be dependent upon 
Miction. The yeas. Amotions required for mis recombination are, however, unknown 
maki ng the finding mat only the Beta function from phage Us required » E. cok that 

much more significant. . 

25 fc one specific, non-limiting example, a point mutation was engmeered mto the 

mouse Brca2 carried on a B AC using a 70 nt oligo. The targeting efficiency was 
several times higher titan would be found wift dsDNA created by annealing ohgos and 
a, least 50 times higher man with dsDNA generated by PGR and containing large 
^ons of nonhomologyintheir center. A 140 mononucleotide has also been us^o 

30 introduce a 29 amino acid in-frame deletion into exonl 1 of tire f>cn2 gone and a 1 .93 
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k b deletion into to BAC vector backbone (Swaminathan, S. et al. (M. 29 14-2!, 
2001 , herein incorporated by reference). Finally, a lo4 n, oligo has been use to 

n. oligo (7 7 X 10") was nearly to same es .be targeting efficiency for generating 
deletions using 140 n. oligos (8.3 XlC and 5.4 X 10». respectively). 

EXAMPLES 
EXAMPLE 1 

Modified Lambda Prophage for Deflned Expression of Recombination Proteins 

The molecular genetics of lambda bacteriophage, inolndmg its lytic and 
isogenic growm cycles, is described in Sambrook e, a,., Bacterioph ageUmWa 
Vectors Chapter 2 in Molecular Cloning: a Laboratory Manual, 2nd Ed, (c) 19S9 
linger Sambrooa < al., Ch. 2); Stryet, Conho. of Gene Expression* Procaryotes, 

Chapt er 32 in Biochemistry 3rd Ed., pp. 799-823, (c) 1988 <^*££ 

0^1 977 in Hendrix et al. eds., Lambda IX Cold Spring 
S Court and Oppenhenn, pp. 251-277 in uenanx 

H lrUbtss,(c)19830r— ConrrandOppenbeim). The comp.ere s^ce 

oftabda is known (see Genbank Accession No. NC 001416, herem mcorporated by 
reference.) 

Phage lambda has a well-characterized homologous recombination system. 
20 Wublesti^dbre^mDNAammeimtiationsrtesformiarecomhmationCThaleret 

a, J Mo.. Bio.. .95: 75-87, .987). U** oxonuclease (Exo) degradea processrve.y 
me 5' ends of tose break sites, and .ambda BeUbinda to me remaining ; 3 smgie 

• a tiip recessed DNA for homologous strand invasion 
strand tail, protecting and preparing the recessed DJNA 9 ^.«o2 2512 

(Little, J. Biol. Chen, 242: 679-686, 1967; Carter et al., J. Biol. Chen, 246. 2502-2512, 

25 1971). 

The lambda recombination system containing exo and be, without gnm ts 
efficient a, gene replacement usurg hnear substrates with homology arms of more ton 
, 000 bp in a sham lackingRecBCD nuclease (Murphy, Journal of Bactenology 180. 
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2063-2071 1998). To test homology arms of less than 100 bp long as substrates for 
lambda-mediated recombination, a lambda prophage was modified to express high 
levels of phage recombination functions for a defined amount of time. 

FIG. 1 depicts the defective X prophage on the E. coli chromosome. The 
defective prophage contains X genes from cl to int. The PL operon is intact and 
expressed under control of the temperature-sensitive lambda cl-repressor (allele cI857). 
A deletion (dotted line) removes the right side of the prophage from era through attR 
and including bioA (Patterson et al., Gene 132: 83-87, 1993). On the chromosome, the 
nadA and gal operons are to the left of the prophage, and the bio genes without bioA are 
to the right. Genes of the X prophage are shown on the solid line, and genes of the host 
are shown on the broken line. PL and PR indicate the early left and right promoters of 
X attL and attR indicate the left and right attachment sites of X. The lambda genes and 
functions are described in Sambrook et al., chapter 2, Stryer, and Court and Oppenhenn. 

The absence of cro-repressor allows PL operon expression to be fully 
derepressed when the temperature sensitive cl-repressor is inactivated at 42°C. The cro 
to bioA deletion removes the replication and lytic genes of the prophage. The functus 
encoded by these lytic genes are toxic to the cell and cause cell death within 7 mm after 
a normal prophage induction. Functions present in the PL operon are also toxic but fall 
cells only after 60 min of continuous induction (Greer, Virology 66: 589-604, 1975; 
Kourilsky et al., Biochimie 56: 1517-1523, 1974). Thus, shifting of cells contaimng the 
PL operon construct from repressed conditions at 32°C to induced conditions at 42°C 
allows PL operon expression. Shifting the cells back from 42°C to 32° C (or lower) 
within 60 min reestablishes repression and prevents cell death. 

As the following examples will demonstrate, this modified lambda prophage has 
produced unexpected advantages in mediating homologous recombination. These 
unexpected advantages include surprisingly high recombination efficiency, prease 
control of recombination functions, effective recombination with short homology arms, 
and ability to generate homologous recombinants with polynucleotides other than long 
double-stranded DNA. Without wishing to be bound by a single explanation of the 
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observed effects, it is likely that these unexpected advantages accrue from incorporation 
of the lambda red genes on the prophage in their native context,thereby limiting the 
number of copies of the lambda recombination genes. In addition, use of the PL 
promoter confers the ability to precisely control the timing and production of large 
amounts of lambda recombination gene expression. 

This system is not limited to expression in E. coli, but can also work in other 
bacteria, such as Salmonella and others. It can also work in eukaryotic cells, such as 
yeast or mammaliancells, with selection of appropriate promoters, and with other 
modifications of the present to allow expression of the lambda recombinase genes. In 
one specific, non-hmiting example, in another bacteria, genes between gam and N 
(including JVbut not gam) can be deleted to remove transcription terminators. 

Although the PL promoter is used to illustrate the invention, other de- 
repressible, inducible or constitutive promoters could be used. Specific, non-limiting 
examples of inducible promoters are drug inducible promoters (e.g a tetracycline 
inducible promoter) metal inducible promoters (e.g. the metallathione promoter), or a 
hormone inducible promoter (e.g. a steroid responsive element). 

The PL promoter could also be used to drive expression of, for example, P22 
genes such as erf, or of RecE and RecT. 
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EXAMPLE 2 

Bacterial strains, Expression of PL operon, Electroporation Methods. 

Identification of Recombinants 

Bacterial strains used in this work are listed in Table 2. 



Table 2 



Strains 
WJW23 
ZH1141 
BR3677 
DY329 
DY330 
DY331 
DY378 
W3110 
HME5 
HME6 
HME9 
HME10 
HME31 
HME40 
HME43 
HME47 
DY41 1 
DH10B 



DY303 
DY374 
DY363 
DY380 
EL11 
EL250 
EL350 



Genotype 



W3110 A(argF-lac)Vl69 gal490 XNAacZ A(N-int) cI857 A(crc-bioA) 
lac? lacZ(M15) A(srl-recA)301::TnlO 

W31 10 A(argF-lac) U169 nadA::TnlO gal490 M857 A(cro-bioA) 
W31 10 A(argF-lac) U169 gaU90 Xcl857 A(cro-bioA) 
W31 10 A(argF-lac)U169 Asrl.recA)301::Tr>10 g al490 XcB57 A(cro-bioA) 

W3 1 10 Xcl857 A(cro-bioA) 
"Wild-type" 

A(argF-lac)U169 U1857 b(cro-bioA) 
UargF-lac)U169 galK^UAO M857 ^cro-bioA) 
MargF-lac)U169 galK^mo ™857 Mcro-bioA) tyrTVocat 
^/•^-^^ (^*^ MsRM 
M.argF-lac)U169galK><catsacB \cl857 Xcro-bioA) 
XargF-lacWMMgaUgal^K^AcW M857 6(cro-bioA) 
^argF-laOUmgalK^X f^<>«' A (gam-N) ci8576{cro-bioA) 
g aim34<>kan Xexo<>cat c\857Mcro-bioA) 
zaHC!134<>kan \c\857 A (cro-bioA) 

P mcrK A^rr.bsJmS-.ncrBC) .SOd/ocZAMlS Afc C X74 deoK recM ^1 -»» A(aro, 
/ew)7649 galU galK rspL 7\upG 
DH10B [Xc/857recA*] 

W31 10 gfl/490 nadAr.TnlO [W857 Kcro-bioA)] 
W3110 A/acU169 gaim [XclSSl {cro-bioA)otet a ] 
DH10B [Xc/857 {cro-bio A)otet] 
DH10B pic/857 (cro-bioK)<>caUsac8\ 
DH10B [Xc/857 {cro-bioKjOaraC-^^flp^ 
DH10B [Xd857 (cn^6ioA)<> flraC-PBADcre1 
(^ro-6ioA)o^ indicates substitution ot cro-bioA with let, 
b p fiAD represents the promoter of araBAD- 
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. Strain DY329 was constructed by transduction of ZH1 141 with PI phage grown 
on WJW23, selecting for nadAMO tetracycline resistance (Tet R ) at 32°C and then 
screening for the presence of a defective lambda prophage which causes temperature 
sensitive cell growth at 42 °C. Similar PI transduction was used to create other strains 
described in Table 2 using standard media, methods, and selections (Sambrook et al., 
Molecular Cloning: a Laboratory Manual, 2nd Ed., (c) 1989; Miller, Experiments m 
Molecular Genetics, Cold Spring Harbor Lab Press (c) 1972). The symbol o is used to 
indicate a replacement generated by homologous recombination. The symbol x 
indicates an insertion generated by homologous recombination. A deletion at the pomt 
of insertion is indicated in parenthesis following the inserted gene. The entire gal 
operon in HME40 is inverted (IN). 

To induce expression from the PL operon and prepare electroporation- 
competent cells, overnight cultures grown at 32°C from isolated colonies were diluted 
50-fold in LB medium and were grown at 32°C with shaking to an OD 600 of about 
0 4-0.8. Induction was performed on a 10 ml culture in a baffled conical flask (50 ml) 
by placing the flask in a water bath at 42°C with shaking (200 revolutions/min) for 15 
min. Immediately after the 15 min induction, the flask was swirled in an ice water 
slurry to cool for 10 min. An uninduced control culture, maintained at 32°C 
throughout, was also placed into the ice slurry. The cooled 10 ml cultures were 
centrifuged for 8 min at 5,500 x g at 4°C. Each cell pellet was suspended in 1 ml of 
ice-cold sterile water, transferred to a 1.5 ml plastic microcentrifuge tube, and was spun 
for 20 seconds at 4°C at maximum speed in a microcentrifuge. After washing the cell 
pellets as described two more times, the cells were suspended in 100 ul of ice cold 
sterile water. This volume of competent cells is sufficient for two standard 
electroporation reactions (~10 8 cells per reaction). Larger cultures canbe prepared for a 
greater number of reactions or for storage of electrocompetent cells at-80°C with 12% 
glycerol present. Fresh competent cells give highest efficiencies of recombination .To 
transform cells by electroporation, purified linear donor DNA (1 to 1 0 ul) was mixed 
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with competent cells in a final volume of 50 ul on ice, and then pipetted into a pre- 
cooled electroporation cuvette (0. 1 cm). The amount of donor DNA used per reaction 
(usually 1 to 1 00 ng) is indicated for relevant experiments. Electroporation was 
performed using a Bio-Rad Gene Falser set at 1.8 kV, 25 uF with Pulse controller of 
5 200 ohms. Two protocols have been used interchangeably to allow segregation of 
recombinant from parental chromosomes within the electroporated cells. In both 
protocols, the electroporated cells were immediately diluted with 1 ml of LB medium. 
In one, the cells were incubated for 1 to 2 hours at 32°C before selecting for 
recombinants. In the other, the cells were immediately diluted and spread on sterile 
10 nitrocelluose filters (100 mm) on LB agar. After a 2 hour incubation at 32°C, the filters 
were transferred to the appropriate agar plates required to select for recombinants. 
Aliquots were also directly spread on LB agar and incubated at 32°C to determine and 
examine total viable cells after electroporation. For drug resistant selection, each ml of 
LB medium contained 10 ug of chloramphenicol, 12.5 ug of tetracycline, 20 ug of 
1 5 kanamycin, 30 ug of ampicillin, or 50g of spectinomycin. 

Although recombinants were verified by more than one method, the primary 
detection was for an altered phenotype caused by the modified target gene. Disruption 
or mutation of the galK gene was confirmed by the presence of white colonies on 
MacConkey galactose indicator agar, disruption of the rnc gene for the 
20 endoribonuclease RNaseDI was confirmed by the inability of lambdoid type phage to 
lysogenize (Court, pp. 71-116 in Belasco et al., eds. Control of Messenger RNA 
Stability, (c) 1993, Academic Press, New York), and deletion of gam, HI, and cIII in the 
PLoperon was scored as an ability of the X lysogen to survive growth at 42°C (Court 
and Oppenheim; Greer, Virology 66: 589-604, 1975). PCR analysis was used to 
25 confirmthealteredstructurecausedbyreplacementofagene. Southern hybridization 
analyses of parental and recombinant DNAs confirmed structural changes, and DNA 
from the recombinant clones can be amplified by PCR and sequenced. 

In addition to electroporation, any suitable method for macromolecular transfer 
into bacterial cells would be effective for practicing the methods herein disclosed. For 
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example, such methods tnay include exposure ,o divalent cations, DMSO and the like as 
described in a variety of standard laboratory publications, such as Sambrook et al. (see 
particularly pages 1 .74-1.84) and Ausubel e. ah, eds., Current Protocols In Molecular 
Biology, John Wiley & Sons (e) 1998 (hereinafter Ausubel et ah), herein incorporated 
5 in their entirety. 

EXAMPLE 3 

Homologous recombination with short linear DNA fragments 

The recombination system described in Examples 1 and 2 were used to generate 
10 a single bp mutation in the bacterial galK gene. 

The galK gene encodes a galactokinase that phosphorylates galactose and its 
derivatives. The galK galactokinase phosphorylates 2-deoxygalactose to generate 2- 
deoxygalactose phosphate (2DGP). While unphosphorylated 2-deoxy g alactose has no 
impact on cell growth, 2DGP is a nonmetabolized sugar phosphate that inhabits cell 
15 growth. Thus, cells containing a wild-type galK gene grow poorly on 2- 

deoxygalactose, a phenotype referred to herein as Gal + . In contrast, mutants defective 
in g alK grow well in the presence of 2-deoxygalactose (Dog), and have a phenotype 
referred to herein as DogR - (Adhya, pages 1503-1512 in Escherechia coh and , 
Salmonella typhimiirium: Cellular and Molecular Biology, Neihardt et al. eds., 
20 American Society of Microbiology, 1987). The DogR-phenotype enables ready 
selection of cells harboring successful recombination events. 

Complementary 70 base pair oligonucleotides were synthesized and annealed to 
each other. The annealed DNA was homologous to an internal coding segment of the 
bacterial galK gene, except that a UAU codon (TYR-145) was changed to a UAG 
25 amber codon. A homologous recombination event between this 70 base pair DNA 

fragment and the galK gene introduces a premature stop codon in the galK gene, and is 
referred to herein as the galK amber mutation. This mutation produces a truncated galK 
gene product that lacks function. 

The 70-bp DNA fragment was transferred by electroporation into galK+ cells 
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(HME5) mai nau ucui 

1 5 min After electroporation with the mutant DNA (1 00 ng) or mock electroporafcon 
without DNA, the cells were spread on minimal O.40/0 glycerol agar medmm wifh 0.2% 
2-deoxygalactose present. Spontaneous resistant mutants occurred frequently (10 ) » 
the absence of mutant DNA. Despite this, the addition of the mutant DNA enhanced 
the frequency of resistant mutants dramatically, generating one mutant per 500 
electroporated cells. 

To determine that temperature induction was required, another batch of cells that 
had no. been induced for recombination function was tested in the same way. In due 
, treatment.nodisceotableeffeotofaddedmumntDNAwasobserved. This indicated 

that both induction of PL operon expresaion and mutant DNA addition were reared 
for the enhanced survival. Wimontwishuts.obeboundbyasmglee.planahouof.he 
observed effects, i. ia believed mat the expressed .ambda functions allowed for efficient 
recombination of this short lirtear mutant DNA with the chromosomal g alK gene. 

5 Colonies surviving 2-deoxygalactose treatment were screened for their Gal 

phenotype on indicator plates, and aU tested had the Gal - phototype expected for a 
g alK amber mutant. To teat specifiealiy whether me g alK amber mutation was present, 
four independent Gal- colonies ware tested by tranaduoing cultures of each wtth a 

Phage mat caniea the «NA W suppressor allele s^F. The four mutants 

20 testedweresuppressedfoaGa.+ phenotypo.Finally.tbepreaenoeoftheamber 

mutation in galK was verified by PGR amplification and sequence ana.yaia of the g alK 
gene segment from the chromosome. 

This example demonstrates that controlled expression of lambda recombination 
genes from a defective lambda prophage promotes surprisingly efficient homologous 
25 re conrhinationinbac,erialoem,evenwifoabo rt lmeMaegmen B ofDNAhavingvery 

short homology arms. 

importantly, the high recombination frequency indicates that recombinants were 
identified without the need to apply positive or negative selection methods. DNA 
hybridization probes are thus designed to detect point mutations, deletions, inserts or 
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other modifications of cellular DNA. Standard colony hybridization or m sthr 
hybridization techniques oan be used ,o detect cells in which racotnbinatron has 
occurred. Alternatively, enriohnrent methods for mutation detection are used, 
particularly for detecting pour, mutations. Such enrichment methods are descrtbed m 
, ^letl.An.a.oftheNewyortcAoademyofScienoes^^-B^OOOw^hts 

herem incorporated by reference in its entirety. One example of a suitable enrtchmen, 
■rmismlh ampliation mutation assay described by Cha e, ah, PC* Appncattons 
and Medrods 2: 14-20, 1992, herein incorporated by reference in its entirety. 
The fact that the DNA fragments with abort homology arms are able to 
0 recombhre in vivo opens a vast array of new possibffities for generating recombman. 
DNA Sever.stepsnorrnaUymvolvedmgeneratingreoomhmantDNAnro.eoulesam 

ehminated. Restriction enzyme digests are not remaned to generate DNA fronts, 
and DNA tigase reactions are not required to join different DNA fragments at novel 
actions. The cell genem.es me completed recombinant precisely joined through 
1 5 homologous recombination. 

The efficiency of recombination approaches 0.1% of surviving cells from a 
stmdar d eUctroporation. At mis efficiency, nnselected colonies could be screened for 
recombinant DNA using colony hybridization techniques, euminating the need for 
selection steps. Thus, mis recombination protoco, makes the bacterial cbromosomeand 
20 plasmid DNA amenable to ahnos, any type of desfred change. This tncludes dtrected 
mutagenesis of a gene, a gene segment, or even a base. 



25 



EXAMPLE 4 

Preparation of linear DNA cassettes greater than 1000 bp in length 

Standard Polymerase Chain Reaction (PCR) conditions were used to amplify 
hnear DNA fragments wim me Expand™ High Fidelity PCR system of Boehrtnger 
Mannheim. The chloramphenicol resistant (Cm*) cassette ca, 
pPCR-Scrip, Cam (Stratagene) wimprimem 5'TGTGACGGAAGATCACTTCG (SEQ 
m NO- 1) and 5'ACCAGCAATAGACATAAGCG (SEQ D> NO:2).. Tbe tetracychne 
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resistant (Tet*) cassette tet was amplified from Tnl 0 with primers 
5'CTCTTGGGTTATCAAGAGGG (SEQ ID N0:3) and 

5'ACTCGACATCTTGGTTACCG (SEQ ID N0:4). The ampicillin resistant (Ap R ) 
cassette amp was amplified from pBluescript (Stratagene) with primers 
5'CATTCAAATATGTATCCGCTC (SEQ ID N0:5) and 

5 AGAGTTGGTAGCTCTTGATC (SEQ ED N0:6). The kanamycin resistant cassette 
kan was amplified from Tn5 with primers 5 'T ATGG AC AGC AAGCG AACCG (SEQ ID 
NO-7) and 5TCAGAAGAACTCGTCAAGAAG (SEQ ID N0:8). PGR products were 
purified using Qiagen PCR purification kits and concentrated if necessary by ethanol 
) precipitation. The amplified linear DNAs were suspended in sterile water or TE buffer 
(lOmM Tris-Cl P H7.5; ImM EDTA) and quantified by spectroscopy. DNA in water 
was stored at -20°C. The inventors avoided PCR product purification schemes from 
gels in which the DNA is subject to ultraviolet irradiation. 

In order to design primers for amplification of a recombination cassette, 
5 recombinant oligonucleotides were chemically synthesized with the 5' 30 to 50 

nucleotides identical to sequences at the target nucleic acid sequence, and with the 3' 20 
nucleotides homologous to the ends of the cassette to be introduced. A cassette is 
generated by PCR that is flanked by the 30 to 50 base homologies present at the target. 
Cells carrying the target DNA either on the chromosome or on a plasmid are 
70 induced for Exo, Beta and Gam function. These cells are made competent for 
electroporation and mixed with the amplified cassette. Following electroporahon, 
recombination occurs between the homologous sequences on the linear cassette and the 
target replaces the target segment with the cassette. 

The 50 ntga/Khomology segments (rectangles) used for the experiment 

25 described in Table 3 are: 

5'GTTTGCGCGCAGTCAGCGATATCCATTTTCGCGAATCCGGAGTGTAAGAA 

(SEQ ID NO: 9) and 

5-TTCATATTGTTCAGCGACAGCTTGCTGTACGGCAGGCACCAGCTCTTCCG 
(SEQ ID NO: 10) 
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h one embodiment, «- — * > *« ^ " * ^ 
if ta «e, serptence in the steps can be counteracted. The transcnp on 

^lg replaced. Tee primers contain two par*: a , end echoes to fla-ng 
5 ^o,«DNA,anda3- 

The PGR using these primers and a DNA template containing the D ^ Wt ^ 

^formation with the template DNA wil, generate me seieeted phenotype for 

10 DNA can be destroyed by rreahnen. with Dpn! Mowing me PC* Dp* 

methylated GATC temp.ate DNA iea^g me new.y repUcated unmemy, a*d DNA 
Ju Onee a Imear cassette has been generated, i, can be stored and nsed as the 
template for subsequent PCRs. 

EXAMPLE 5 

15 Gene replacement by targeted homologous recombination 

, ..^ftu. linear DNA can direct mutations 
Having demonstrated in Example 3 that 70-bphnearDiNAC 

. anerirDNAhavmgSO-bpga^DNAsegmentsflankmgthecfli 
to a specific target, a synthetic DNA navmg ->u ps aatrnU Co eae 

, casS ette was constructed for targetmg a gaK gene 

(chloramphenicol resistance, or Cm ) cassette 

replacement by cat. 

, 0 The linear cat cassette with flanking gnlKDNA was made by PCR nsing 

chemioally synthesized primers, as described in Example 4. 

Dato from ttaese experiment are presented in Table 3. DY330 competent cells 
we re operated wim ,00 ng of me - cassette targeted ,o 

25 lhage g aes). Tola! reeombmants per electroporauon are show, m me nghmrost 
^"CmRrecombinan," The en, cassette was transferred by "» 
» K+ ecus which had eimer been heat-indnoed for PL operon ^on 15 mm 
Ceranne ah* to 4rC, as indicated by IT in me eenter cohnnn of Table 3, or no, 
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md »ced (maintained at 32°C; indicated by "0" in the eenter column of Tab.e 3). See 
mou v . . . j „,w methods After electroporanon, 

also Example 2 for description of mducuon and other methods. Alt 

Cm" recombinants were selected at 32°C, and then qoanhfied. 

As shown in Tahie 3, Cm* cotomes were only found in me heat-induced culmre. 
AD 50 Cm*co,onies tested had a Gal- phenotype on MacCookey galactose md.ca.or 
Z — the presence of the ^o- replacement The * 
dement generated hy homologous re—ion .eahmm.es, or examp. 

'.Koca, indicates ma. me bacterial gm* gene is replaced by eat usmg homologous 
recombination techniques. 

Table 3 



■ w 42X,min Cm R Recombinants 
Target Site* 



<1 

4 



GalK 1 2.5x10' 
GaZK 

CMkilgam 0 

15 5.0x10 

CIU Ml gam 



<1 

4 



15 



20 



m staiiar experiments using the same 50-bp homologous arms,thegaKbas 
heen exchanged, or to , nt-tp. and cassettes by selector Km ,Amp andTe, , 
ceahngg^oto, ■**»«* andga/ K <>.«teplaoements,respea,,vel y . 

T„ .estwhether dais approach also works a. oppositions onthebaetenal 

taneriiatelynpstraamrmd downstream of the « gene encodmg — ^™ 
s ene is thought to be non-essential (TaBff et al„ Journal of Bac.eno.ogy 171 . 2581 
Z 19S9) merefore, 1, was tested whether an exact subsuhrrion of the codmg 

U - - -*« 10 COd ° n ^ 06 ^ Td t me 

Xnhinauon Uchm,nes herein described. In mis construe, - is 
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Following procedures used for galK as described in Examples 2 and 3, Cm 
octanes were found, bur only in to induced culture. The Cm* colonies tested had a 
Rnc mutant phenotype, as described in Example 2. 

Two other rncocai recombinants were made. One replaced sequence from the 
AUG start to oodon 126 dm. and the other from the AUG star, to codon 192 of rnc. 
These two recombinants generate cat,mc B en. fusions with an mc mutant phenotype. 
Different se* of primers were chosen to detect unambiguously the wild-type and/or 
recombinant alleles. This PCR procedure follows guidelines set forth by yeast 
researchers in characterizing chromosomal replacements in yeas. (Wmzeler e. aL 
Science 285: 901-906, 1989). The PGR analysis of the menmbinams venfled the loss 
the rnc* gene and the predicted stnrcmres of the tea mcoca, gene replacements. 

In this example, several different genes on the bacterial chromosome and on 
ptanids have been substituted with drug resistance marker, However, > alao 
possible to create recombinants in which the desired product does no. include a 
Lectable marker. Genes have been fused .o cassehea encoding specific .ags such as 
me green fluoresoen. prouh, Fusion <ags can be placed precisely m me gene ,0 be 
momfied, for example, by any of me following Regies. In one, me unaelec.ed 
cassene is joined to a seleCable dmg marker, and bom are rccombmcd in.o .he chosen 
,oca«ion selecfing for drug reside. In anofiaer, me oasseho is recomhmed mto , d. 
Nation by snbstimung i. for a negative selecfion marker tike : (BlomMd e, d, 
Mol Microbiol 5, 1447-57,199!). This strategy permits cloning of any DNA. mafinrd 
strategy, the recombinants are screened non-selectiveiy by DNA hybridization wrtb 
probes specific to the cassette. 

to ttrese experiment the desired recombination product was usually obtained. 
; However, same recombination products were onexpecfed. In tivo oases, an attempt was 
made .0 knock-on. essential genes, apd auxprisingly i. was posaible .0 selec. a few rare 
recombinants. These burned ou. to be diploid for the re^on of the targeted glance 
fcey carried the wild type and fire mutant copy of the gene as dctemuned by PCR 
analysis. Rare diploid regions of the bacterial chromosome am known to occur 
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• n= «t a freauencv of about 0.1% (Haack and Roth, Genetics 
spontaneously in growing cells at a frequency oiao -^ were 

14 1245-1252,1995). Because an essential gene was targeted, these rare daplo.ds were 
selected. This was only possible because the recombination is so efficient. 

This example demonstrates that the methods of this invention can be used to 
promote efficient gene replacement by homologous recombination. The gene 
eplacements were made throughout the bacterial chromosome. In Example 7 

plasmids, bacterial artificial chromosomes, cosmids, phagemids, and the like. 

EXAMPLE 6 
Induction time, DNA amount, and 
homology arm length affect targeting efficiency 

Inducti on time. FIG. 3 shows the effect of mduction time on — ^ 
The strain DY330 was grown at 32°C to OD^O.4 to 0.8. heat-induced at 42 C forthe 

• L indicated and then made electrocompetent (see Example 2 for descnption of 
timesmmc n ftllt ,was used to target the d/JH gam genes of the 

methods). A linear cat cassette (10 ng) was used to g 
prophage. Total Cm* recombinants were plotted vs. the time of induction, 
induction of PL operon expression for only five minutes enhanced 

• Ptr 3 reveals that by 7.5 min of heat induction a maximum 

recombination activity. FIG. 3 reveais mdi 

of the PL operon for longer than 60 min causes cell death. 

Cells harhoring the defective lamhda prophage may be grown a. temperatures 
other lhan 3 2 »C. fc genera!, i, is undesirable to gro W cells a, 1»P— ^ 
„ hecause suoh .emperahues lead to partial inaouvalion of me d repressor, and * 
expression fiom the PL promoter, m genera,, « ia also turdeairable !o gmw cefc a 
expression u r m the art would also 

temperatures below 20°C, because of slow growth. One skilled in 
J^ematmereisaconsiderabledegreeoflatimdewith^^ 
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tempm hn. of induction. For exaunale, expression of —6a recotnbmahon genee 
from the PL operorr could be Mooed a. temperatures as low as 38 «C, generally 
snowing for longer times of induction. The limit for protein expression m K cok u 
about 45°C. 

5 DcorUU ««* FIG.4shows«he.ffec<ofamountofft«lmear D NA 

cass ette on recombmation. The attain DY330 was grown a, 32°C to OD^O, , 0. 
mdnced a, 42°C for 15 min and tiren made e.ectrocompetent. Different amounts 1, R 
tOO, 300, 1,000 ng) of a hnear - cassette (1 hbp in length) were used to ^rge, the 
U g an, genes of the prophage. Total Cm* recombinants were plotted vs. the DNA 

1 0 amount at 42°C. 

a- increased in a near linear relationship with 

Fig. 4 shows that targeting efficiency increased in an 

.ereasm annexion of donor OKA m the range from loP d ng, to .0 C> 
mo.ecu.espere.eotropomtton. Aaattnanngleve.oflmearWAtsreaehed^xtO 
mo.ecu.es vie)mn g 7.5xl0< recombinants per~2x!0« ceUs olecttoporated. ThuMhe 
« methods of to invention maybe pracdeed ovee ahroad rang, of ohgonnoleottde 
concentrations. 

DMV <** HG 5 shows me effect of homologous arm lengd, on 
r ecombi.a«on.ThesttainsDY330(^ + J fl medchele S ) m dDY33 1 (^-Xopen 

.holes) were grow, at 32°C, indnced a, 42°C for 15 min and thenmad. 
20 e.ectrocompetent. A nneaa^r cassette (lOOng) was naedto target me cf^ «£. 

g en K ofttr.prophage. Th«homo,ogooaam ll engmofd>eoaaaettewaavanedttomO,o 

L . . „,unt n <vf, Vin homologies were chemically 
1 000 bp. The primers containing the 0 to 50 bp nomoiogi 

resized as described (Fi, 2). The casaette containing 1,000 hp homologous a»s 
was made by FCR using primers 1,000 bp away on each side of an extsttng (cBH 
25 g^oco, disruption in the eel). Total Cm* recombmants were plotted vs. the 
homologous arm length. 

Several pahs of primers were made to amplify ft. cai cassette for targeting lb. 

W*h of da. homology secant on me prmreca varied by incrementa o, 10 baaes from 
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10 to 50 bases. A nested set of linear cat cassettes was made with the primers. Another 
linear cat cassette was constructed flanked by 1,000 bp of homology. This set of hnear 
DNAs was then tested for recombinational targeting efficiency as shown in FIG. 5 . No 
recombmants were found with 10 bp of homology and less than ten recombinants were 

5 found in each ofthree experiments with 20 bp of homology. From 20 bp to 40 bp of 
homology, homologous recombination increased by four orders of magnitude. From 40 
bp to 1,000 bp of homology, recombination increased only 10-fold. 

These data indicate that the methods herein disclosed may be practiced with 
surprisingly short homology arms, as few as 20-40 residues. However, homology arms 

10 of 30 or greater residues increase efficiency. 



15 



20 



25 



EXAMPLE 7 

Gene replacement on plasmids and BACs: in vivo cloning 
To determine if this method could be used to modify plasmid DNA, the 
procedures described in Examples 2-3 were followed to modify plasmid P GB2, a 
derivative of pSClOl (Bernardi et al., Nucleic Acids Res. 12: 9415-9426, 1984). A cat 
cassette was synthesized in vitro and recombined in vivo with P GB2 to replace the 
spectinomycin resistance gene with cat conferring Cm* on the cell carrying the 
recombinant plasmid. 

The same experiment, performed on pBR322 derivatives, generated 
recombinants, but they were joined in tandem to non-recombinant plasmids as donas 
and higher multimers. Induction of Gam expression from our prophage macttvates 
RecBCD nuclease. In the absence of RecBCD, P BR322 derivatives replicate by a 
rolling circle mode (Feiss et al, Gene 17: 123-130, 1982), and the plasmid converts 
from monomers to multimers. This is specific for pBR322-type replicons as the pGB2 
type did not foim multimers. 

To generate simple recombinants of pBR322 derivatives, the protocol was 
xnodified by coelectroporating the recA' strain DY331 with circular plasmid DNA (0.1 
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ng ) and a linear drug cassette. Recombinant plasmid monomers were readily selected 
and isolated. 

In addition to plasmids, the method is also suitable for targeting genes on 
bacterial artificial chromosomes, phagemid artificial chromosomes, yeast arhficral 
chromosomes, cosmids, and the like. Homologous recombination between target 
nucleic acid sequences on BACs and synthetic oligonucleotides (such as ds DNA 
fragments or PGR fragments) is carried out in bacterial cells bearing the defective 
lambda prophage shown in FIG. 1 and described in Example 1. Synthe tl c 
oligonucleotides (such as short annealed dsDNA fragments or PGR fragments) are 
electroporated into bacterial cells as described in Example 2. Analysis for successful 
recombination events is by selective PGR amplification using specific primers for the 
introduced sequence, or by selective amplification approaches snch as the mismatch 
amplification mutation assay described by Cha et al., PGR Applications and Methods 2: 
14 20 1992orbydirecthybridizationusingspecific P robes. Using tins approach 
recombination frequencies of up to 1 :500 have been observed, regardless of me strand 
targeted. Thus, mis system is extremely useful in manipulation of and rapid screemng 
for recombinants in BAG vectors. The unexpectedly high efficiencies eliminate the 
need for introducing selectable markers, or modifying such markers on the BAG. 



20 



EXAMPLE 8 
Requirement of RecA for targeted recombination 

The requirement for RecA were tested in targeted homologous recombination by 
repeatingtheexperimentdescribedinFig.Sbutusinga^- strain. Surpnsmgly, 
recombination efficiency was depressed only about 10-fold in the recA- mutant for the 
25 arm lengths tested (Fig. 5). Thus, RecA function is not required, and recA- strams 
m ediate efficient homologous recombination with linear DNA fragments having 
homology arms of 30 bp or greater. 

This result was unexpected. The Recombination system is known to function 
in cells lacking the bacterial RecA function (Brooks and Clark, J. Virol. 1: 283-293, 
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1967) However, the recombination in recA mutants under coupons used by outers 
Hied more than 50-fold relative to ieveis in ^ ceils (Stahl et a,, Genetics 77. 
395-408, 1974; Murphy, J. Bacterid. 180: 2063-2071, 1998). 

EXAMPLE 9 
Lambda genes promote targeted recombination 
of dsDNA in wild type E. coli 

To determine which .ambda genes promote targeted recombination of dsDNA, a 

j ■ fVit» pt nneron of the prophage using cat 
se , of replacement deletions wore generated m the PL operon ot P S 

a»d J—, m the center clnmn of Tahle 4, the patent* 
deletioLadehydterecombinationeventwidnndreprophage (refer toFrg. 1 fora 

deletions m deletions was verified 

Uneax map of the prophage genes). Each of these newly 

stractuI auy byPCR^ 
' into galK. 

El „pe,en, cells from sfrains mdieated in Tahle 4 were hearttdnoe * 
rented as total nnmbcr of Tet* recombn-s per eleofroporahon . the ngh, hand 



column of Table 4. 



Table 4 
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Strain 



DY330 
DY392 
DY351 
DY386 

DY349 



Prophage 



wild-type 
(hin-int)oamp 
(sieB-kil)ocat 
(hin-int)oamp 
(sieB-kiI)Ocat 
(gam)ocat 



Recombinant 




4,100 
2,000 
4,400 
1,650 

0 
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DY360 (bet)ocat 0 

DY359 (exo)ocat 



Table 4 shows that only exo, bet, and gam deletions affected galKotet targeted 
recombrnation. Deletion of any one of these three genes eliminated the recombrnatron, 
whereas deletion of all other genes in the P L operonhad little if any effect. 

To show that the gamocat substitution was not polar on bet and exo, the gam 
gene was expressed in trans and shown to complement the defect. Thus, although the 
entire PL operon was used in studies described here, only exo bet and gam funcuons are 
needed for recombination with double-stranded DNA cassettes >100 -200 bp made by 

PCR. 
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EXAMPLE 10 
Efficient Recombination with Single-Strand DNA 

To evaluate recombination between exogenous single-strand DNA and the E. 
coli chromosome, the experiments described in this Example were performed. In 
1 5 addition, the experiments also address the role of the bacterial recA recombmahon 
genes in mediating the observed effects. 

Expression from the PL operon was heat-induced, cells were made 
electroporation-competent, and 70-mer oligonucleotides were electroporated into cells, 
all as described in Example 2. Strains HME9 and HME10 both harbor the galK *nber 
mutation, and are therefore Gal- in phenotype. They differ from each other in that 
HME9 is recA + , whereas HME10 is recA-. Thus in this experiment recombmaton 
efficiencies in recA+ and recA-cells are compared. 

The 70-mer single-stranded oligonucleotides used in this experiment were 
designed to restore wild-type galK gene activity (hereinafter galK + ) upon successful 
25 recombination, thereby producing a Gal+ phenotype (i.e., ability to grow onmmunal 
media with galactose as the sole carbon source). The 70-mer corresponding to the 
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transcriptional non-template DNA strand of galK was 

5'AAGTCGCGGTCGGAACCGTATTGCAGCAGCTTTATCATCTGCCGCTGGAC 
GGCGCACAAATCGCGCTTAA (SEQ ID NO:ll). 

70-mer single-stranded DNA of either SEQ ID NO: 1 1 or its complement was 
5 electroporated into cells. Alternatively, the two 70-mers were first annealed to each 
other and then electroporated into cells as double strand DNA. A successful 
recombination event was identified by restoration of the Gal+ phenotype. Table 5 
indicates the number of galK+ recombinants per viable cell, x 10" 4 . 

Table 5 presents the number of recombinants observed x 1 (T after 
10 electroporation of the HME9 or HME10 strains with DNA in the indicated forms. 
Efficient recombination was observed with double strand DNA, similar to that 
previously described in Examples 3 and 5. Surprisingly, single-strand DNA was about 
equally or even more efficient than double-stranded DNA in producing homologous 
recombination, regardless of strand used. In this experiment, recombination efficiencies 
15 for the double-stranded DNA was about 1 in 3800 cells (2.6 - 2.7 x 10^), whereas 
recombination efficiency for single-stranded counterclockwise DNA was 3-to 7-fold 
higher. In other strains (e.g DY374) recombination of single strand linear DNA as 
frequent as one nadA + recombinant per 20 viable cells has been observed. In addition, 
efficient recombination was observed in both recA+ and recA- cells, establishing that 
20 the recombination events did not require the bacterial recA gene products. 
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Table 5 
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Strain Used 



DNA used in electroporation 
dsDNA cc-ssDNA cw-ssDNA 

(about lug) (about 0.7 ug)(about 0.6 fig) 



HME9 
galK am XCI857A(cro-bio) 

10 HME10 

g alK am X,CI857A(cro-bio) recA' 



2.6 



18.5 



2.2 



2.7 



0.35 



cc=clockwise (strand with the 5' to V orientation relative to transcription) 
cw=counterclockwise (strand with the 3' to 5' orientation) 



Efficient Generation 



EXAMPLE 11 

ion of Large Deletions by Recombination with Single-strand 

DNA 



Example 10 demonstrates efficient lambda-mediated recombination using 
ssDNA to generate a single base change in the E. coli chromosome. This example 
demonstrates similar high efficiency when the approach is used to generate large 
deletions. 

Using the methods described in Examples 2, 3, and 10, lambda operon 
expression was beat-Induced, cells were rendered electtoporation-competent, and 70- 

_gle-stranded DNA of either SEQ IDNO:ll or its complement was 
e.ectroporatedintocehs. The DNA was electroporated into two strains: onocontammg 
,he galK amber mutation, and one in which ihe g aK gene was interrupted by a cawcB 
cassette precisely inserted at the position of the amber mutation in S alK. The strams 
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were otherwise genetically identical. A successful recombination event was identified 
byrestorationoftheGal+phenotype. Table 6 indicates the number of galK + 
recombinants per viable cell, x 10 . 

The data in Table 6 demonstrate that recombination efficiency is sirmlar using 
the same oligonucleotide, regardless of whether lambda-mediated ssDNA 
recombination is being used to generate a single base change or a large deleton, 
removal of the 3264 bp cat-sacB cassette. In both cases, the method is highly effluent 
in generating recombinants. 



10 



Table 6 



25 



Strain Used 



DNA used in electroporation 
cc-ssDNA cw-ssDNA 
(200 ng) ( 20 ° n 8) 



15 

HME6 
galK am ^CI8 5 7 A(cro-bio) 

HME31 

20 galKocatsacB XCI857A(cro-bio) 



15 



10 



0.5 



The length that is reasonable to synthesize chemically limits the length of smgle 
strand oligonucleotides used for recombination. It is demonstrated herein that two 
oligonucleotides that have a complementary overlap region at their V ends, when co- 
electroporated into DY41 1 cells, can anneal and generate recombinants wrth 
chromosomal or extrachromosomal DNA (Figure 14). This recombination requxres the 
induction of the PL operon and the Gam, Beta, and Exo functions. A galK mutatton m 

created by recombineering. Two oligos were synthesized that were 70 bases long, wrth 



WO 02/14495 



PCT/USOl/25507 



60 



34 bases of the deleted region at their 3' ends, that were complementary and can act to 
anneal the two oligos together. The 5' end of each oligo contained 36 bases of 
homology to each side of the 34bp deletion caused by kan. Each oligo alone cannot 
generate gal+ recombinants but mixed together they generated up to 10 5 recombinants 
5 per 1 0 8 cells electroporated. Oligos with the same sequence but shortened from their 3' 
end to overlap by only 2 bases did not yield recombinants. However, overlaps of 10 
bases or more generated recombinants. Preannealing was not required and the two 
oligos can be mixed and used directly for electroporation. 

If the ends of the overlaps are filled in by DNA polymerase a 104 bp duplex is 
10 generated. This dsDNA generates only slightly more recombinants then the DNA with 
10 to 34 base overhangs. Thus, multiple overlapping (by >10 bases) oligonucleotides 
of any even number can be used to yeild recombinants, in which the most outside 
oligonucleotides have 5' overlaps. The end oligonucleotides also have 30-50 bases of 
homology to the targeted region. The use of multiple overlapping oligonucleotides 
1 5 allows production of long recombination substrates without use of PCR. The central 
oligonucleotide(s) can be any cassette envisioned to be used for dsDNA recombination. 
This recombination with overlapping oligonucleotides having outside 5' overhangs is 
most efficient with Exo, Beta, and Gam, but can be recombined by Beta alone (without 
Exo and Gam) in the cell. This greatly simplifies the recombination procedure (as only 
20 Beta is required). Although the 1,04 bp duplex DNA recombines more efficiently if 

Exo, Beta and Gam are present, recombination also occurs in the absence of Gam, albeit 
at a lower efficiency (the duplex requires both Exo and Beta for recobination). 

Similar overlapping synthetic oligonucleotides can be generated with 3' 
overhangs of 34 bases that can be coelectroporated into cells. These are also 
25 recombined into targets defined by homology at the ends. Again, only Beta is required 
for this recombination. In this case, Exo is not required, and further Exo does not 
stimulate recombination. In one embodiment, multiple oligonucleotides can be 
overlapped as above to span longer distance. As long as the outermost oligonucleotides 
have 3' overhangs, recombination will be Exo independent. The efficiencies of the 
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present system allows the detection of recombinants in this case. 

Examples 10 and 11, taken together, document that the methods disclosed herein 
can be practiced with ssDNA oligonucleotides. This surprising result enables high 
efficiency homologous recombination with synthetic DNA of single or double 
strandedness. 

The present system allows the limit of synthetic oligonucleotide size to be 
increased dramatically by overlapping oligonucleotides. In addition, the system allows 
recombination of these DNAs to be carried out with Beta alone, or Exo with Beta but 
wi thout Gam. Recombination without the requirement for Gam is important because 
Gam is the toxic function that was a limiting factor in the previously described methods. 
As the present system requires only Beta, a constitutive promoter can be utihzed. 

EXAMPLE 12 
Effect of ssDNA Length on Recombination Efficiency 

in Examples 10 and 11, lambda-mediated recombination was used to efficiently 
incorporate 70-mer ssDNAs into the * coli chromosome. In this example, the effect of 
oligonucleotide length on recombination efficiency was investigated. 

Using the methods described in Examples 2, 3, 5, 10 and 1 1 , lambda operon 
expression was heat-induced, cells were rendered electroporation-competent, and 
ssDNA oligonucleotides (200 n g each) were electroporated into E. coli HME9 stram 
cells The electroporated ssDNA oligonucleotides included the 70-mer of SEQ ID 
NO-11 a 60-mer constructed by removing the last 5 nucleotides from both the 5' and 3' 
ends of SEQ ID NO:ll, and a 50-mer, 40-mer, 30-mer, or 20-mer constructed by 
removing the last 10, 1 5, 20, or 25 nucleotides, respectively, from both the 5' and 3 
ends of SEQ ID NO.ll. As in example 10, the ssDNA oligonucleotides used m this 
experiment were all designed to restore the galK + gene upon successful recombrnauon, 
thereby conferring upon the cell a Gal + phenotype. Table 7 indicates the number of 

-4 

gaLK+ recombinants per viable cell, x 10 . 
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Table 7 



10 



15 



Oligonucleotide length 
0 20 30 40 50 60 70 



5 Efficiency 0.004 0.01 



0.47 



22 



(xlO" 4 ) 



20 



As the data in Table 7 demonstrate, recombination efficiency increases with 
increasing ssDNA length. Recombination efficiency was low when the ssDNA used 
was a 20-mer, but increased considerably with a 30-mer. Efficiency was near optnnal 
with a 40-mer, and increased to 1 in 450 viable cells with a 70-me, Hence, specffic 
examples of the invention use single-stranded DNA molecules at least about 40 
nucleotides in length. 

Without wishing to be bound by a single explanation of the observed effects, the 
inventors currently believe that observed length-efficiency relationship may reflect 
published data indicating that lambda Beta protein binds stably to DNA sequences 
bases or longer, but does not bind as well to shorter oligonucleotides (Mythm et al„ 
Gene 182: 81-87, 1996). 

EXAMPLE 13 

Lambda Beta Protein Mediates Efficient Recombination with ssDNA 

To determine whether lambda Beta protein was sufficient to mediate 
recombination between exogenous ssDNA and the*, coli chromosome, the efficiency- 
of recombination was investigated in a strain that expressed lambda Beta, but not Exo 
25 or Gam. 

For these experiments, the HME43 strain was used. Its genotype is identical to 
the HME6 strain, except that the lambda prophage contains additional genetic deleuons, 
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from int through exo and from gam through N (see FIG. 1). In addition, the cat gene 
conferring the CmR phenotype is inserted between attL and bet. 

Using the methods described in Examples 2, 3, and 10-12, expression of the 
modified lambda operon was heat-induced, cells were rendered electroporation- 
competent, and 70-mer ssDNA ofSEQ ED NO: 1 1 (200 ng) was electroporated into 
cells. Using this procedure, the HME43 strain expresses lambda Beta protein, but does 
not express gam, exo, or any.other prophage encoded genes. A successful 
recombination event was identified by restoration of the Gal+ phenotype. Table 8 
indicates the number of galK+ recombinants per viable cell, x 10 -4 . 

Table 8 



15 



Strain Prophage Modifications Recombination Efficiency (x 10" 4 ) 



HME43 (int-exo)ocat, (gam-N)oA 



7.7 



20 



25 



In contrast to Example 9 using PCR-generated double-stranded DNA, the data 
presented in this example establish that lambda Beta alone is sufficient to mediate 
efficient recombination between ssDNA and the E. coli chromosome. Moreover, two or 
more overlapping oligonucleotides may be used, if they have a 3' overhang and more 
than about 10 bp of overlap. Overlapping oligonucleotides with a 5' overhang also 
promote homologous recombination with Beta alone. However, for 5' overhangs, exo 
and gam (or a similar exonuclease and RecBCD-inhibition function) appear to enhance 
maximal efficiency. 

A modification of the method is to place DNA encoding other ssDNA binding 
polypeptides under control of the PL promoter. For example, the strain HME43 is 
further modified to delete bet and insert DNA encoding P22 Erf, RecT, or Rad52. 
Expression of the ssDNA binding polypeptide is induced by temperature shift as it is for 
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induction of lambda bet expression. Exo and Gam, or proteins with similar function, 
may also be placed under control of the PL promoter. Moreover, other inducible or 
constituitive promoters maybe used. 



EXAMPLE 14 
Ex Vivo Combination of ssDNA With Lambda Beta 
Mediates Efficient Homologous Recombination 



I wanted you to know that this experiment has not worked yet. It is still a n 
10 experiment that has not yet been completed. 

Single-strand DNA can be combined with lambda Beta protein prior to 
electroporation into cells, and mediated efficient recombination between the ssDNA and 
the host DNA. 

Lambda Beta proteins may be prepared by techniques known in the art 
15 (Karakousis et at, J. Mol. Biol. 276: 721-731, 1998), and preincubated at 37 °C with 
single-strand oligonucleotides of 20-mer or greater length, hi this example, ssDNA 
oligonucleotides of SEQ ID NO: 1 1, a 60-mer constructed by removing the last 5 
nucleotides from both the 5' and 3' ends of SEQ ID NO: 11, and a 50-mer constructed 
by removing the last 10 nucleotides from both the 5' and 3' ends of SEQ ID NO: 1 1 
20 were used. Typically, lambda Beta protein concentration is about 2.5 uM and DNA 
concentration about 5 uM, but the method is effective with a broad range of protein and 
DNA concentrations (for example, from 0.1 \M to 10 mM protein, and 0.01 uM to 10 
mM ssDNA). Alternatively, the Beta protein and ssDNA can be coelectroporated into 
cells without premixture or preincubation. 
25 The DNA and protein is electoporated into E. coli using methods described in 

Examples 2 and 3. hi this example HME 43 strain is used, but numerous other strains 
are suitable. Expression of the modified lambda operon is one set of cells is heat- 
induced, and a second set of cells is maintained at 32 °C. Both sets of cells are rendered 
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dectroporation-competent, and 70-mer ssDNA of SEQ ID NO: 1 1 (200 ng) is 
electroplated into both heat-induced and uninduced cells. Using this procedure, the 
HME43 strain expresses lambda Beta protein upon temperature shift to 42°C, but does 
not express Beta from bet or any other prophage-encoded genes in the absence of a 
5 temperature shift. A successful recombination event is identified by restoration of the 
Gal+ phenotype. 

In this experiment, high efficiency recombination is observed in both heat- 
induced and uninduced cells. Moreover, it is believed that approximately equally high 

• efficiency recombination is observed when these techniques are followed in * coh 

1 0 strains that contain no lambda prophage genes. 

This approach may be modified by substituting other ssDNA binding 
polypeptides for lambda Beta, such as P 22 Erf, RecT and Rad52. The target nucleic 
acid sequence may be on the bacterial chromosome, or on exogenous DNA such as a 
bacterial artificial chromosome, phagemid artificial chromosome, plasmid, cosrmd, or 

15 the like. Moreover, there is no particular requirement for a specific bactenal specres; 
these single-strand DNA binding polypeptides will mediate efficient recombination m a 
broad range of bacteria. Indeed, these polypeptides will mediate efficient 
recombination in eukaryotic cells as well, as in Example 15. 



20 
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EXAMPLE 15 
Lambda Beta Protein Mediates Efficient 

Homologous Recombination in Eukaryotic Cells 

The ex vivo approach described in Example 14 may be used to target genes in 
eukaryotic cells for homologous recombination. In eukaryotic cells, transfection of the 
ssDNA with lambda Beta protein may be accomplished by electroporahon as in 
Examples 2, 3 and 14, or by the methods of Chang et al., Biochimica et Biophysica 
Acta, 153-160 (1992), Keating and Toneguzzo, Bone Marrow Purging and Processing. 
491-498 (1990), or other electroporation protocols known in the art. In addition, a 
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variety of means fox macromolecular transfer methods are known to the art, including 
calcium phosphate-DNA co-precipUation (Ausubel et al.), DEAE-dextran-mediated 
transfection (Matthews et al, Expenmental Hematology. 21 : 697-702, 1993) polybrene- 
m ediated transfection (Costello et al., Gene Therapy 7: 596-604, 2000), microinjecUon 
(Davis et al., Blood 95: 437-44, 2000), liposome fusion and lipofection (Vert et al., 
Cardiovascular Research. 43: 808-22, 1999), protoplast fusion (Schaffher, Proc. Natl. 
Acad Sci USA 77: 2163, 1980), inactivated adenovirus-mediated transfer (Wagner et 
al Proc Natl Acad Sci USA 89:6099-6103, 1992), hemagglutin virus of Japan- (re- 
mediated transfer (Morishita et al., Journal of Clinical Investigation 93 : 1458-1464, 
1994) biolistics (particle bombardment) and the like. Any such macromolecular 
transfer approach is suitable. Design ofdsDNA molecules for facilitating homologous 
recombination with eukaryotic genes is well known in the art (for example, as descnbed 
in Mansour, Nature 336: 348-352, 1988; Shesely, PNAS 88: 4294-4298, 1991; 
Capecchi, M. R, Trends in Genetics 5: 70-76, 1989; U.S. Patent No. 6,063,630). 

Cells to be transfected with exogenous DNA are combined with a DNA 
construct comprising the exogenous DNA, targeting DNA sequences and, optionally, 
DNA encoding one or more selectable markers. The resulting combination is treated m 
such a manner that the DNA construct enters the cells. This is accomplished by 
subjecting the combination to electroporation, microinjection, or other method of 
introducing DNA into vertebrate cells. Once in the cell, the exogenous ssDNA as 
integrated into cellular DNA by homologous recombination between DNA sequences m 
the DNA construct and DNA sequences in the cellular DNA. 

For example, the target nucleic acid is the beta-globin gene in hematopoietic 
stem cells from a patient with sickle cell anemia (Beutler, Disorders of Hemoglobm, 
; Ch 107 in Harrison's Principles of Internal Medicine, 14* ed. © 1998, herein 

incorporated by reference). The sickle cell Beta globin gene harbors a point mutate 
that substitutes a Val for Glu at position six of the polypeptide chain, resultmg m an 
abnormal hemoglobin which is prone to inappropriate polymerization. The methods of 
this invention can be used to correct the mutation. 
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Hematopoetic stem cells from a sickle cell patient are isolated, cultured, and 
expanded ex vivo as is known in the art (Brugger, Seminars in Hematology 37[1 Suppl 
2]- 42-49 2000- Dao et al., Blood 92: 4612-21, 1998; Aglietta et al., Haematologica 83: 
824-48 1998; Emerson, Blood 87: 3082-8, 1996). A 60-mer ssDNA oligonucleoude of 
SEQIDNO:12 (AACAGACACC ATGGTGCACC TGACTCCTGA 
GGAGAAGTCT GCCGTTACTG CCCTGTGGGG) is synthesized and partially 
purified by standard techniques (Pfleiderer et al., Acta Biochimica Polonica. 43:37-44, 
1996; Anderson et al., Applied Biochemistry & Biotechnology 54: 19-42, 1995, herein 
incorporated by reference). 

After culture and ex vivo expansion, about 10 6 hematopoetic stem cells are 
suspended in 0.4 mL PBS containing 0.1% glucose, about 10 purified lambda Beta 
pro tein, and about 1 ug ssDNA oligonucleotide of SEQ ID NO:12. The cell suspension 
is electroporated in a 1-mL cuvette at 280V and 250 uF with a Gene Pulser (Bio-Rad 
Laboratories Inc., Hercules, California, USA). Cells are then plated and cultured. 
Homologous recombinants harboring the mutation are identified and clonally isolated, 
further expanded ex vivo, and may be returned to the patient, or cultured for addmonal 
in vitro study. 

Those skilled in the art will recognize that a broad range of ssDNA and ssDNA 
binding polypeptide concentrations will be effective, as in Example 14. For example, 
hoth ssDNA and ssDNA binding protein may be present in concentrations ranging from 
0 001 uM to 100 mM; or from 0.1 mM to 1 pM; or from 1 uM to 100 uM. 
Oligonucleotide length can be varied in accordance with parameters presented m 
Example 12. There is no particular upper limit on oligonucleotide length. Inadchuon, 
two or more oligonucleotides can be included which have complementary 5> ends, 
thereby creating 3' overhangs which are effective substrates for ssDNA bindmg 
polypeptides such as lambda Bet, In addition, RecT, P22 Erf, Rad52, and other double 

strand break repair ssDNAbinding polypeptides may be substituted for lambdaBeta. 

Culture and electroporation conditions are readily variable without materially reducmg 
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homologous recombination. Moreover, nucleic acid may be introduced into the cell by 
any suitable macromolecular transfer method. 

Other types of stem cells may be used to correct the specific gene defects 
associated with cells derived from such stem cells. Such other stem cells mclude 
5 epithelial, liver, lung, muscle, endothelial, mesenchymal, neural and bone stem cells. 

Alternatively, certain disease states can be treated by modifying the genome of 
ceUsinawaythatdoesnotcorrectageneticdefectpersebutprovidesforthe 

supplementation of the gene product of a defective gene. For example, endothehal cells 
are preferred as targets for human gene therapy to treat disorders affecting factors 
10 nonnallypresentinthesystemiccirculafion. In model studiesusing both dogs ;andp igs 
endothelial cells have been shown to form primary cultures, to be transformable w*h 
• DNA in culture, and to be capable of expressing a transgene upon re-implantaton m 

wi ♦ ni Qripnrp 244- 1344, 1989; Nabel et 
arterial grafts into the host organism. Wdson et al., Science, 244. % , 

al Science, 244: 1342, 1989. Since endothelial cells form an integral part of the graft, 
15 such transformed cells can be used to produce proteins to be secreted mtothe 

circulatory system and thus serve as therapeutic agents in the treatment of genetic 
disorders affecting circulating factor, Examples of such diseases include insulin- 
deficient diabetes, alpha-l-antitrypsdn deficiency, and hemophilia. Epithelial cells, 
oocytes and hepatocytes are also useful cell types for therapeutic production of 
20 proteins. 

The method is also useful for knockout or modification of genes in embryonic 
stem (ES) cells. Such cells have been manipulated to introduce transgenes. ESceUsare 
ohtained from pre-implantation embryos cultured in vitro. Evans et al., Nature, 2 2 
154-156 1981; Bradley et al., Nature, 309, 255-258, 1984; Gossler, et al., Proc. Natl. 
25 Acad. Sci. U.S.A., 83, 9065-9069,- 1986; Robertson, et al., Nature, 322, 445-448, 198 ; 
US PatentNo. 5,464,764. Oligonucleotides designed to target specific gene segments 
in the ES cell are combined with lambda Beta protein or other ssDNA binding 
polypeptide and introduced into ES cells by electroporation or other transformation 
me thods. The oligonucleotides maybe designed as a series of overlapping segments 
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with 3' overhangs. Such transformed ES oells oan thereafter he combted wrth 
blastocysts ftont a non-hnntan annnal. The ES eeUs thereafter colonize the .embryo and 
oan con«ribu,e to the geror line of the resulting chimeric animal (faenisch, Scrence, 240, 

1468-1474, 1988). 

For example, sequences encoding positive selection marker neomycin 
resistance gene ace synthesis as a serica of overlapping 70-mer oligonucleotides, 20 
hase pairs of overlap and 3- overhangs. The V tormina! o.igonnCeottdes am desr^ 
to Jeri into tine second exon of tine mouse box 1.1 gene as deserihed tn U.S. Eaten, No. 
5 464 764. Because me overlapping oligonucleotides combine to encode a promoters 
M om'ycin reside gene, only those that snocessmlly incorporate into the targeted 
mouse box 1.1 second exon win express the neo gene product and have tire neomycm 
resistance phenotype. Tbe targeting is designed to provide the symbolic neomycrn 
resistance gene wrti, an operable promoter and translation alar, derived ftom me mouse 
hoxl 1 gene. The torgetingDNA is also designed soma, random incorporations 
elsewhere m tire ES cell genome are murkely <o be operably linked to any promos to 
allow transcription and translation. 

The series of overlapping oligonucleotides wiur V overhangs (about 200 
oanograma each, are combined with 10 uM lambda Beta pro,ein and introduced into ES 
ceus by eleetroporation using tire Promega Biotech X-Cell 2000. Rapidly gtowmgcells 
are trypsinized, washed in DMEM, counted *rd resuspended in buffer contanmrg 20 
* HEPES <PH 7.0), 137 mM NaCl, 5 mM KC1, 0.7 mM NaaHPO., 6 mM dextiose, 
and 0, mM .bCa-mercaptoeti^o,. Ins, pnor ,o electioporation, tire 
and lambda Beaprotein aze added to 10 7 ES cells in each 1 ml-cuve«e. CeUs and DNA 
are exposed to «wo sequential 625 V/cm pulses a, room temperatore, allowed to remam 
5 in tire buffer for 10 minutes, men plated in non-selective media onto feeder cells. 

Following iwo days of non-selective growth, tire cells are nypsinized and 
re p,a,ed on,o G418 (250 ug/ml) media. The positive-selection is applied for 
Because of the hi* efficiency of lambda Be<a-mediated recombination, the need for 
fWher selection (for example, negative selection by intioduemg a thymidine kmase 
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.eneand^ectagwiftgancidovir)^ b=obvia,ed. Appropriately transformed, 
Lle-resistan. cells are grown in non-se,ective medU for 2-5 days prior «o injection 
into blaslocys* aeeording to .he method of Bradley in Teratooaroinonras and .nrbryon,e 
stem eells, a practical approach, edited by E. J. Robertson, Press, Oxford (1987), p. 
5 125. 

Blastocysts containing the targeted ES cells are implanted into pseudo-pregnant 
femaies and ailowed to develop to term. Chimeric offspring are identified by coat-color 
m a*.rs and those males showing cmmerism were selected for breeding offsprtng 
Those offspring which carry the mntan, aMe oaar be identified by coa, color, an 0. 
W presence of the mutant allele reaffirmed by DNA ^ysis by tail-blot, DNA analyses. 

Thus the method markedly simplifies the construction of gene knockouts and 
gene modifications in ES cells. In addition to its possible relevance to plant, annual 
and human gene therapy, the method will simptify the construction of tiartsgemo 
animals harboring either gene knockouts or gene modifioanons. 

, 5 As described for stem cells and in Example 14, a broad rauge ofssDNA and 

ssDNA binding polypeptide concentrations wil. be effective. For examp.e, bod, ssDNA 
aad ssDNA binding proleiu may be present in amounls ranging fiom 0.00! pM to 100 
nM; or torn 0.1 pM ,o 1 pM; or fiom 1 uM to 100 pM. Oligonucleotide lengua can be 
varied m accordance with parameters presented in Examp.e 12. Tbere is no particular 

J0 upperlinutonotigonucleotidelengm. In addition, two or more oligonucleotides can be 
iluded which have complement T ends (for examp,e wim 10, 20, 30, 40 bp 

. , 1 +wv nvirrVianffs which are effective substrates 
complementary 5' ends), thereby creating 3 overhangs wmcn 

for ssDNA binding polypeptides such as lambda Bet, In addition, RecT, P22 Erf, 
Rad52 and other double strand break repair ssDNA binding polypeptides may be 
25 substimtedforlambdaBeta^thesamerangesdescribedforlambdaBet, Those 

skilled in the art will recognize that culture and electroporation conditions are readdy 
variable without materially reducing homologous recombination. Moreover, nuclei 
acid may be introduced into the cell by any suitable macromolecular transfer method. 
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EXAMPLE 16 
Homologous Recombination in Plants 

The methods disclosed herein are also applicable to the manipulation of plant 
cells and ultimately the genome of the entire plant. A wide variety of transgenic plants 
have been reported, including herbaceous dicots, woody dicots and monocots. For a 
summary, see Gasser, et al. (1989), Science, 244, 1293-1299. A number of different 
gene transfer techniques have been developed for producing such transgenic plants and 
transformed plant cells. One technique used Agrobacterium tumefaciens as a gene 
transfer system. Rogers, et al. (1986), Methods Enzymol., 118, 627-640. A closely 
related transformation utilizes the bacterium Agrobacterium rhizogenes. In each of 
these systems a Ti or Ri plant transformation vector can be constructed contammg 
border regions which define the DNA sequence to be inserted into the plant genome. 
These systems previously have been used to randomly integrate exogenous DNA to 
15 plant genomes. 

Preferably, DNA designed for homologous recombination with a target DNA 
sequence inplants are combined with lambda Beta protein or other ssDNAprotein and 
directly transferred to plant protoplasts by way of methods analogous to that prevxously 
used to introduce transgenes into protoplast, Concentration of the DNA and ssDNA 
binding proteins are as described in Example 15. See, e.g. Paszkowski, et al. (1984), 
EMBO J 3, 2717-2722; Hain, et al. (1985), Mol. Gen. Genet., 199, 161-168; Smlhto, 
et al (1985), Biotechnology, 3, 1099-1 103; andNegrutiu, et al. (1987), Plant Mol. 
Bio 8 363-373. Alternatively, the PNS vector is contained within a liposome wmch 
maybe fused to a plant protoplast (see, e.g. Deshayes, et al. (1985), EMBO J., 4, 2731- 
2738) or is directly inserted to plant protoplast by way of intranuclear micromjecton 
(see, e.g. Crossway. et al. (1986), Mol. Gen Genet., 202, 179-185, and Reich, et al. 
(1986), Bio/Technology, 4, 1001-1004). Microinjection is the preferred method for 
transfecting protoplasts. The DNA and ssDNA binding proteins may also be 
microinjected into meristematic inflorenscences. De la Pena et al. (1987), Nature, 325, 
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274-276. Finally, tissue explants can be transfected by way of a high velocity 
m croprojectile coated with the DNA and ssDNA binding proteins analogous to the 
me thods used for insertion of transgenes. See, e.g. Vasil (1988), Biotechnology 6, 
397- Klein, et al. (1987), Nature, 327, 70; Klein, et al. (1988), Proc. Natl. Acad. Sea. 
USA 85 8502; McCabe,etal. (1988), Bio/Technology, 6, 923; and Klem.etal., 
Genetic Engineering, Vol 1 1, J- K. Setlow editor (Academic Press, N.Y 1989). Such 
transformed explants can be used to regenerate for example various senal crop, Vasu 
(1988), Bio/Technology, 6, 397. 

Once to DNA and ssDNA binding protein have been inserted into to plant cell 
by m y of to foregoing methods. Wagons recombination targets to ohgonucleot.de 
,o to appropriate site in to plant genome. As in previous examples, to 
obgonucleotide may be a series of overling ssDNAs whh 5' or 3' overhangs. 
Dependmgupontometodologyusedto transfer selection is perfotmed on hssue 
cultures of to uansfonned protoplast or plan, eeU. hr some instances, cells amenable to 
to culture may be excised from a transformed plant either 60m to FO or a 
subsequent generation. 

The amino acid composition of various storage proteins in wheat and com, for 
example, which are lotown to be deficient in lysine and tryptophan may also be 
m „dined. FNS vectors can be readily designed .0 alter specific codons wrUnn such 
, storage proteins ,0 encode lyshte and*, tryptophan thereby increasing .be 

vahte of such crop, For example, to zein protein in com (Pederson 1 al. (1982) Cell, 
29, 1 01 5) may be modified to have a higher content of lysine and tryptophan by to 

vectors and methods of the invention. 
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EXAMPLE 17 

Creation Of Improved Bacterial Host Strains for Lambda-Mediated 

Recombination in BACs 

5 Transfer of lambda recombination genes to DH1 OB cells. 

To facilitate the use of lambda-mediated recombination with BACs, an 
improved phage-mediated recombination system has been created for efficient 
recombination nsing BACs. The DH10B strain unlike most other strains of* coh is 
efficiently transformed with BAC DNA and contains many of the BAC genomic 

10 libraries;* was judged to be a good host strain for subsequent modification. 

Because DH10B is recA defective, standard genetic crosses cannot be used to 
place the defective lambda prophage used for lambda-mediated recombination into the 
DH10B strain of* coli. To circumvent this problem, DH10B was first converted to 
recA + , then the lambda recombination genes were crossed in and the strain was again 
1 5 made recA- but now carrying the lambda genes. 

To make DH10B recA+ , a lambda transducing phage carrying the recA+ gene 
was used to lysogenize DH10B creating the derivative DY303. In strain DY330 used 
for X mediated recombination, the tet gene conferring tetracycline resistance was 
inserted by homologous recombination where the cro-bioA deletion exists creating 
20 strain DY363 .API lysate made on DY363 was used to infect DY303 and by standard 
bacterial genetics the tet gene in the cro-bioA deletion was crossed into DY303. This 
deletes a large segment of the X DNA of the lysogen including the recA + gene. This 
new derivative of DH10B and DY303 is named DY380, is RecA- and carries the tet 
selectable marker substituted for the cro-bioA segment.. It was observed that DY380 
25 cells were transformed with BAC DNA at efficiencies of 10" 6 to 10" 4 . 
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Creation ofDY380 derivatives containing arabinose-inducible ere orflpe genes 

BAC targeting often makes use of a selectable marker to introduce the targeting 
cassette into the targeted locus. The selectable marker can, however, interfere with the 
subsequent function of the targeted locus. To circumvent this problem, the inventors 
5 noted that a selectable marker flanked with either FRT or loxp sites can be removed by 
either Flp or Cre recombinases, respectively. Thus, the inventors have created two new 
strains, EL250 and EL350, by ultimately replacing the tet gene in DY380 with araC and 
placing fhe/jpe and cre genes under an arabinose-inducible promoter. The genotypes of 
DY380, EL250, and EL350 are shown schematically in FIG. 6. Although the 
10 arabinose-inducible promoter P bad was used in this example, essentially any inducible 
promoter may be used to activate flpe and cre expression. 

■ In DY380's prophage, tet is located between cI857 and bioA. In EL250's 
prophage,^ replaces tet (flpe is a genetically engineered^ that has ahigher 
recombination efficiency than the original/!/, gene; Buchholz et al, Nature 
15 Biotechnology 16: 657-662, 1998). Thus, as illustrated in FIG. 6, both EL250 and 

EL350 have heat-inducible homologous recombination (the X red genes) and arabinose- 
inducible site-specific recombination (flpe or cre) functions. This dual regulation 
allows both selective targeting by recombination as well as the subsequent removal of 
the selection marker from the targeted locus by site-specific recombination. 
20 Improved approach for introducing defective lambda prophages into bacteria: mini 
lambda circles 

A method has been developed for introducing the X-mediated recombination 
system directly into nearly any E. coli strain including recA defective DH10B 
derivatives. These derivatives may carry BACs, PACs, or other vectors. 
25 The DY330 strain carries deletion of prophage genes from cro through bioA. 

This deleted segment of X and bioA were replaced to create a derivative that contains a 
folly normal XcI857 single-copy lysogen. Lysogens of this type can be induced at 42° 
to express X functions including the Red recombination functions. Because the X 
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^es aU of «he repl.ca.ion and aytic genea, induction for longer .bar, 6 n,^~ 

^4 nainu.es, reconabhaauon hancuons arc on ly par,* — « . Us 
when retted to grow a, 32°C. Using, for example, a 4-nnnu.e time of 
induction, X-naediared reconab,nan<s ean be generated berween Uneart 
DNA and ,be chronaosona. including the DNA of the prophage. Tons, phage lambda 
ZZ bensed.l^eandgener.ereao.bhaan.sn.BACsar^. However, 

recombination efficiency wouid be low hecanse of the shot induction tanae. 

PCR eaasettes confining 5 genea for differen. drag resistance markers were 
^pMed [cat, ta, -» M a P « ™* to """ 0 ^ " " * 

r Jg or reairtan. reoonabhaan. a, 3TC A conngnons prophage ^ 
LblposaHonSS^ofrhe^apinc^obaaeposinon^^of^e naapan 

^arerop.aeadbydaedrngcaasettes.tseaConatandOppenheina). Thasddetion 

aucbnaen. aires a*, and u«* a, nae termini of one prophage whereaa DY330 has 
teough bioA deleted as part of the cro-to^ deletton. 

This se. of stiains (wi«h respective dnag oasset.es) oan be hadnced for longer 
, dntes rtaan Ore contp.ete lambda wrthou. «ng ft. ce.,s utereby providing n,»nnal 
boanologons recombination activity jus, as warn DY330. The pa operon of these 
prophageaandndenaea-rand^gene, Induction activates their expressaon and 
propndgcs m m ^ r ;fi r excision of the prophage as a 

because bona a«L and a«R are present causes arte spectfic excasaon J J 

DNA circle carrying Us associated drug naarxer. CeUs nndergonag na uch™ foH5 mm 
, may ,„se rhe orignaa, prophage. This happens ha abon, 50% of the " 

Jbavenaeprophage. The 50% wrth, he prophage are Uxe.yto occur by renrteg.r.on 
ofUaeoiroularDNAatnaewca.edannstenaroughha.-media.etisi.eapec.nc 

recombination. 



WO 02/14495 



PCTAJSO 1/25507 



76 



in 
si 



10 



n e defective prophage DNA can be isolated and punned from 
if after a 15 minute induction, eells „ lysed andDNA is isolated by plasnud 
it alter aum Hrcular phage DNA with its drug 

Ikers can be purified. This DNA cannot replicate upon rejection into * «* 
markers can v fimrtion to allow integration of the 

strains but it ean express its pL operon and fotfunction to all S 
circular DNA by site specific recombination between attP in the circular ^ 
circular ui y functions are required for 

in the bacterial chromosome. Only Int and the host mx 

v .tfen Such integrated DNAs are stable, are immune and canbe 
itespecific recombmation. bucn inregr«ne 

selected by the drug marker each carries. 

Becaus e KecA is no. quired for site speciftc 
DH10B derivatives can be used fbr ft—tion and for uftegrafton of tiae oucular 
defective phage DNA selecting for its appropriate drug marten 

t 

The defective nuni-prophage can also be induced as part of a di-lysogen in 
1 a ,7*57 nhase is also present. The phage lysate createdby this 9U 
15 which a complete A, cI857 phage is als p lambda phage particles as well 

min induction at 42 degrees in L-Broth generates normal lambda ph g p 
as particles that contain die defective mini-prophage DNA (In X terming « 

lis. infection of these lysates into cells (e.g. DH10B) allows DNA m3ection of 
particles). Infection on prombination ^d selection for the drug marker 

the mini-prophage DNA, site specific recombmation, an 

20 carried on that DNA. 

EXAMPLE 18 

An improved Strategy and Improved Reagents for BAC Engineering 

^ v .t™1p 17 inBAC engineering, the efficiency of 
To test the prophage system of Example 17 in ^ e ^, er unents described in this 
BAC recombination in EL250 cells was investigated. In the expenm _ 

1 actable cassette was targeted to a mouse neuron-specific locus ma 250 kb 
25 example, a selectable cassette was g subseqU ent mouse 

BAC The BAC was then further modified to enhance its useru 
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genetic studies. These expenmenis validated a, improved strategy and provided improved 
reagents for BAC engineering nsing the lamhda recombination system. 

The Eno2 gene is located in the middle of 284H12, a fully sequenced BAC 
(obtained fiom Research Genetics; Ansari-Uri etal., Genome Research 8: 2940, 1998). 
The Eno2 gene was targeted beoanse ,, is neura>-speci fi c and exposed in ™- 
neurons (Marangos and Sehmechel, Annnal Review of Neurosctence lOt 269-295, 1987). 
By knolg on, eno2 and replacing 1, with a era-containing cassette a BAC ^genr^ 
Une that expresses Cre in a« mahare nenrons was created (described m » 
BAC transgenic line is usefttl for subsequent condhiona, knockout stitdtes. The tnventom 

tj a rc cr P laree enough to contain all the important 
used a BAC approach in part because BACs are large enougn 

regulatory sequences required for proper regulation of gene expression. 

The following describes the construction of the BAC transgenic line with neuronal- 
specific Cre expression. 

Generation of the targeting cassette and BAC-contammg EL2SO cells 

The OBS-eGFPcre-FRT-kan-m^t cassette was PCR amplified ftom 
PICGN21, which was — tedby subcloning a 1.9 kbp «««*- - . 
fiued-in mT-tan-FRT^ fiom pFRTneo tnto the ^I/BcMgeated and fil ed-tn 
cloning site of pKESeGC. The HMV «#HW«r — - 
^tag chimeric 63 n« primer, The 3' 21 n. of each primer was homologous to the 
20 targeting cassette used for amplification while the V 42 nt was homologous to the las, 
cxlof^whe.mecassettewaa.obe.rge.edhyrecombtna.ion. The pnnrers were 

designed to precisely targe, me cassette downstteam of the Eno2 stop codon and upstream 
of its polyAsite. 

The f^-comaming 284H12 BAC was electtoporated into EL250 cells and six 
25 chloramphentco, resist. (Cm") colonies seiected. Digestion of BAC DNA from » Cm 
Monies wim EcoXl or AMD. showed ma, one had an abnormal digesdon pattern 
However, in other BAC elentroporation experiments involving me analysis of more <h=n 
76 additional colonies, no abnormal BACs were identified. These resuhs mdrca,= ft* 
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BAC rearrangements during elaboration are rare. Subsequent experiments were 
carried out with Cm R -resistant EL250 colonies harboring BACs having proper EcoW 
and/or HindW. digestion patterns. 

Generating and isolating a BAC with a disrupted eno2 locus 
5 Next the 284H12 BAC was modified to disrupt the eno2 locus with the IRES- 

eGFPcre-FRT-kan-FRT targeting cassette. The methods used in these experiments were 
similar to those described extensively herein, and in Yu et ah (Proc. Natl. Acad. Sci. 
U.S.A. 97: 5978-5983, 2000). The approach is illustrated schematically in FIG. 7. 

EL250 cells carrying the 284H12 BAC were shifted to 42°C for 15 min. to induce 
10 lambdaExo,Beta,andGamexpression. The cells were then electroporated with 300 ng of 
the amplified IRES-eGFPcre-FRT-kan-FRT cassette, and kanamycin-resistant (Km ) 
colonies were selected. A kanamycin-resistant phenotype indicated that the targeting 
cassette was successfully integrated into the 284H12 BAC (illustrated as "Targeting," FIG. 
7 middle). Approximately 5200 Km R colonies were obtained from 10* electroporated 
15 cells for a targeting efficiency of about 10" 5 . No colonies were obtained from control cells 
that were not heat-induced. Thus, lambda recombinase expression was required for 
efficient recombination. 

Twenty-four kanamycin resistant colonies were analyzed with whole-cell PCR 
using primers that flanked the targeted locus. The PCR results indicated that all were 
20 correctly targeted. Sequencing of the targeted region from six colonies, however, showed 
that three carried point mutations. To determine whether these point mutations were 
introduced during PCR amplification or during homologous recombination, the targeting 
was repeated. This time, however, the PCR-amplified IRES-eGFPcre-FRT-kan-FRT 
cassette was subcloned into the M site of pBluescript by blunt-end ligation before 
25 targeting, and plasmids carrying wild type amplified cassettes identified by DNA 

sequencing. These cassettes were then released from the plasmid by BamSL digestion and 
used for targeting. Using this two-step method, all twelve targeted BACs that were 
subsequently sequenced contained wild type IRES-eGFPcre-FRT-kan-FRT cassettes. 
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These results indreate *a« the point mutations were introduced by the pnmers used or 
during PCR amplification of the targeting cassette rather man during targetmg. 

Removing the kanamycin-resistance marker 

Next the kan seleotabie marker was removed fiom me BAC to prevent it torn 
posstbiy interfering with Cre expression. This process was initiated by arahinos. 
Lment, which indnoes EL250 ceUs . express the Hp. recombmase. The process 
illustrated in Fig. 7, bottom line CFlip-out of to"). 

Overnight cuUwes fiom single ** colonies were diluted 50-fold ,n LB medium 
md ^own tili OD s „„ - 0.5. F!pe expression fiom the EL250 cells was fe. mduced hy 
, fa cuhatingmeou 1 nne S w i m0. 1 %L-amhinose,or 1 hr.Thehao.eria 1 ce 1 lswere 

UphL. Platea (.2, ugtmf). The nex, day, .00 Cm*colonies were ptcxed and 
related on to plates (25 ug/ml) to test for loss of kanamyom reststanoe. 
C^hom J) — htdicates tha, me cell retained me BAC, whereas hanamyom 
, " indioatesma.hanhasheensnooess.tHyremove.fiommeBAC. Allcolomes 

were Km' and contained a single FRT site at the targeted loons. 

Without hemg hound hy menry, it is EX* that the surprisingly high recombination 

effi ciency refieots the tight control of Flpe expression afforded hy tite shrg^opyP^ 
p.omo.erandflpegene.andtitefaotmatthe^si.esareiooatedme^amerthanm 

20 trans to each other. 

Removing an undesirable toP aire in ,he BAC vector backbone 

AtoxPsimcontaM^mtheBACv^haeH^feBcloBACn^mzuy.^ 

Pt „c. Natl. Aead. Soi. U.S. A. S9: 8794-8797, 1992) was removed hy a final round of gene 
targeting. 

25 To H* the remove, of this undeshable loxP site, a new plasmid, pTamp, was 

locaKd 5' of the to xP site and 370 bp of pBeloBACl 1 sequence )ooate4 3 of tite toxP 
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This amp insert can be released from pTamp by toffl digestion and used to replace the 
loxP site in the BAC transgene by gene targeting. This targeting reaction is very efficient 
due to the large amount of homology between the amp cassette and the pBeloBACl 1 
vector (56,200 colonies per 10 8 electroporated cells). 

Upon removal of the undesirable loxp site, the modified 284H12 BAC was used in 
the transgenic mouse studies described in Example 20. 

EXAMPLE 19 
Subcloning by GAP Repair 

This ^-mediated recombination system can also be used to subclone fragments from BACs 
without the use of restriction enzymes or DNA ligases. This form of subcloning rehes on 
gap repair to recombine the free ends of a linear plasmid vector with homologous 
sequences carried on the BAC (FIG. 8). The method is readily adaptable to other forms of 
intramolecular and extrachromosomal DNA, such as plasmids, yeast artificial 
chromosomes, PI artificial chromosomes, and cosmids. This novel method combmes 
lambda mediated recombination with gap repair to enable recombination of very large 
DNA segments onto an extrachromosomal vector. 

The linear plasmid vector with an amp selectable marker and an origin of 
replication carries the recombinogenic ends (FIG 8B). The vector is generated by PGR 
amplification using two chimeric primer, The 5' 45-52 nt of each primer is homologous 
to the two ends of the BAC sequence to be subcloned while the V 20 nt is homologous to 
plasmid DNA. Recombination generates a circular plasmid in which the DNA insert was 
retrieved from the BAC DNA via gap repair. Circular plasmids are selected by then 
Amp R . 

To determine the maximum sized fragment that can be subcloned from BACs using 
this method, several different pairs of primers were generated in which the homology 
segments were located 25 kb, 48 kb, 60 kb, or 80 kb apart in the End BAC DNA (Fig. 
8A) Rare cutter NotI and Sail restriction sites were also incorporated into these primers 
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so that the subcloned fragments could be released from the recombinant clones intact. 
Using pBluescript as the cloning vector, it was possible to subclone the 25 kb fragment. 
However, attempts to subclone larger fragments were unsuccessful. As a possible 
explanation for this result, it was hypothesized that subclones containing larger fragments 
5 on a high copy vector such as pBluescript were toxic to the cell. 

To determine if the hypothesis was correct, a lower copy number vector (pBR322, 
with its copy number control element intact) was used as the cloning vector. Fragments as 
large as 80 kb could be subcloned with a P BR322 vector. Not all subclones obtained by 
gap repair had the correct inserts (as determined by restriction enzyme pattern analysis). 
L0 Some subclones lacked inserts while others contained inserts with aberrant restriction 
patterns In order to confirm that the correct insert has been subcloned, when using 
subcloning by gap repair, a method of screening subclones can be used to assure that the 
selected subcloned contains the desired insert. Such methods include restriction mapping, 
sequencing, PCR analysis, Southern analysis, etc., and other methods well known to those 
15 of skill in the art. 

The ability to subclone large fragments of genomic DNA by gap repair should 
facilitate many studies in genome research that were difficult or impossible to perform 
previously. For example, Gap repair for cloning on to vectors can be used with many 
different vectors used for protein expression in bacteria, plants and animal cells; 
20 mutagenesis; cloning; transcription; etc. Targeting vectors or transgenic constructs can be 
subcloned with ease, and virtually any region of the engineered BAG can now be mcluded 
in the desired subclone. 

Lambda mediated recombination combined with gap repair makes it possible to 
subclone fragments from complex mixtures without first purifying the DNA to be 
25 subcloned. TTris greatly facilitates the subcloning process and allows for high throughput 
subcloning of tens of thousands of genes or DNA molecules into many different vector 
backbones. This will greatly facilitate studies designed to determine the function of genes 
uncovered in large scale sequencing projects. For example, cDNA clones for genes of 
unknown function can be subcloned into many different expression vectors and the 
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tactic* of these genes snrdres in ceU-based assays in vino or in the whole animai. Tins 
W. of subctadng does no. rely on PCR amplification, which can introduce unwanted 
mutations into the subcloned sequences. 

Subclone by gap repair also facilitates tire identification of locus control regions 

* + w m ™ Kp located at some distance from the gene. Many 
5 or other regulatory elements that may be located ax som 

such potential elements are presently being identified by techniques such as compare 
genome sequencing. Examples include pathogemcity islands, replicative origins and 
segregation elements. The ability to modify precisely these regulatory sequences on 
BACs, combined with the ability to include or exclude them during the subcloning 
10 process, will make it possible to dissect the function of these sequences in the whole 
animal or in vitro at a level not previously possible. 

EXAMPLE 20 
Production of Transgenic Mice Using BACs 

Examples 17 and 18 describe the construction of a modified BAG believed to 
15 containallofthereg^latorysequencesneededforneural-specific^ 

transgenic mice. To investigate this hypothesis, the modified BAG described m Example 
18 was injected into (CSH^eN-Mtv" X C57BL/6Ncr) F 2 zygotes. A BAC transgenic hue 
carrying approximately two copies of the transgene was then established. 

In addition to the BAC transgenic line, two transgenic lines carrying 25-kbp 
subclones of the BAC were also established. The 25-kbp subclones-contains the entire 
modified Eno2 coding region as well as 10 kbp of 5' and 5 kbp of 3' flanking sequences, 
respectively. One transgenic line, 25kbp-l carries approximately four copies of the 
transgene, while the second, 25kbp-2 carries approximately five copies of the transgene. 

• • tuRAr transgenic line could be compared to Cre expression in 
Thus, Cre expression in the BAL transgenic iwc 

25 the transgenic lines carrying the subclone. 

The transgenic mice were crossed to ROSA26 reporter mice, which contain a lacZ 

reporter that canbe activatedby Crerecombinase (Soriano, Nature Genetics 221:70-71, 
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im . Doub.e heteroses were subaetpently anaryzed by X-gal staining * 4 weeks of 
* Severa, different tissue* were examined to, X-gal expression including .he brain 

were found only in neural tissue in the three transgenrc tares, mdtcann. th 
ana the 25 «p suhclone conuin the rectory element needed for neuml-specmc 
^Lion IpartemofCreacdvity was, however, different in therbree hue,. 
« seottoL of the brad, from dae BAC trance mice showed blue— 
annughou, the gray mauerbu, not in me white matter, indioative of Cre ae ^mmost 
^bu.no.ingua.cells. mconhast.X.g.s^nginthoaS.thp-l and ^ 
transgenic tniee was present h, only a suhset of neurons and expreaston was vanab.e 

between the two different lines. 

HigherpowermagnificadonoffheoerebeUtunoffheBACtransgent 

« Cre J. expressed in vb-y all neuronal cells. ThUMudedF^ece^md* 

, fth» Am cerebellar nuclei. Li contrast, in me 

cells in the molecular layer and neurons of the deep cerebellar 

cens uiuic ui x^fnnioi cells in addition to a few cells in 

25kbp-l line, Cre was expressed in only a subset of Golgi cells in 
&e ^ule and Purlcinje cell layer, Glial cells of white matter also express d £ 

• t *. 9^Vhn 7 line Cre expression was limited to the gray 

— — * to 2 * ^ Lulg most basae, oeUs, stedate 

matterandtocludedavarietyofneuronaloeUtypes,inotu<nng 

Lpntxinjeoensandnenronsofdredeepcerebe^nuolei. 
cells and Golgi oeUs in the granule layer expressed Cre. 

Hi^er power magnifrcadon of the hippooanrpns and onrtex ahowed sunita result, 

s (CA) region and the dentate gyrus (DG) expressed Cre. The same waa trae m the =or^ 
w^lixUyersofdtecortexthatconUinedneuronsOayeran-VTlexpressedCre. In 

where all sue layers «nicmice ahowed reduced Cre expression tn 

contrast, the hippocampus of 2*bp-l transgenic mr 
*. DO (Fig. 4E) and layers H and ffl of cone, The *xbp-2 — c 
even lower .evels of Cre expression in the DG. The CM and CA2 regtons of ttte 
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failed to express Cre. Cre expression was also greatly reduced in the cortex, with layers II 
and IH showing most the reduction. 

Cre activity in the spinal cord, dorsal root ganglion (DRG) and retina of the 
transgenic mice was also examined in order to determine whether Cre was expressed m 
mature neurons within the peripheral nervous system. Similar to what was observed for 
the central nervous system, Cre was expressed in most mature peripheral neurons m the 
BAC transgenic mice while fewer peripheral neurons expressed Cre in the two 25kbp 
transgenic lines (data not shown). 

To determine whether Cre was expressed in all Eno2 protein-positive neurons, a 
section from the brain of a BAC transgenic animal was inununostained with an anu-Eno2 
antibody followed by X-gal staining for Cre activity. Virtually all Eno2-po S itive neurons 
were active for Cre. Thus, Cre expression in BAC transgenic animals correlated tightly 
with native mouse Eno2 promoter-enhancer activity. 

The present application, particularly Examples 17-20, describes a highly efficient 
r ecombinationsystemformani P ulatingBACDNAin£. coll The recombination system 
uses a defective X prophage to supply functions that protect and recombine the 
electroporated linear DNA targeting cassette with the BAC sequence. Because the 
recombination functions are expressed from a defective prophage rather that a plasmrd, the 
recombination functions are not lost during cell growth as often happens with plasmrd- 
based systems. Another advantage of this prophage system is that the X gam a*d red 
recombination genes are under the control of the temperature sensitive X repressor that 
provides a much tighter control of gam and red expression than can be obtamed on 
plasmids. This tight regulation, combined with the strong X PL promoter, winch drives 
gam and red expression to very high levels, makes it possible to achieve recombmatron 
frequencies that are suprisingly efficienct (at least 50-100 fold higher than those obtamed 
with plasmid-based systems; Narayanan, et al., Gene Therapy 6: 446: 442-447,1999; 
Muyersetal., Nucleic Acids Research 27: 1555-1557, 1999). The tight control prevents 
expression of any recombination functions except for the 15min temperature induction. 
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The ability to precisely manipulate large fragments of genomic DNA, independent 
of the location of appropriate restriction enzyme sites, has many applications for functional 
genomics, both in the mouse and in other organisms. As shown herein, Cre can be 
introduced into the coding regions of genes carried on BACs facilitating the generation of 
5 Cre-expressing transgenic lines for use in conditional knockout studies or for use m 

conditional gene expression studies. Genes can also be epitope tagged and microinjected 
into the germlineofmice carrying amutation in the gene. Ifthe epitope tagged transgene 
' rescues the mutant phenotype, the epitope tagged protein is functional and the epitope tag 
can serve as a marker for expression of the gene. Likewise, a gene carried on a BAG can 
10 be replaced with another gene and the function of the "knock-in" mutation assayed m 
transgenic mice. 

This recombination system also facilitates the generation of complicated 
conditional targeting vectors. While the generation of such vectors often used to take 
several months it can now be done in a only few weeks time. The ability to reversmly 
15 express Cre or Flpe recombinases in E. coli speeds this process even further. Moreover, as 
demonstrated in Example 1 8, a selectable marker flanked with lax? or FRT sites can be 
now be introduced into an intron of a gene and then removed by transient Cre or Flpe 
expression leaving behind a solo loxP or FRT site in the intron. 

EXAMPLE 21 

BAC Recombination without Drug Selection 

The high efficiency of recombination described in Example 18 and elsewhere in 
these examples suggested that it might be possible to do targeting without drug selection. 
Direct targeting without drug selection would offer a number of significant advantages. In 
particular, it would facilitate genomic experiments in which the presence of a selectable 
25 marker, or even a FRT or lox? , scar might be undesirable. 

To demonstrate that targeting can be achieved without drug selection, a 24 bp 
FLAG tag was targeted to the 5' end of the SRY-box containing gene 4 (Sox4 ) gene 
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■„ ,l25kbBAC Fortheseexpenments^lHbptargetingcassettewas 

03111 ! !! 45 bp arms homologous to the So* gene flanked the 24-b P FLAG 

generated m which two 45-bp arms horn oligollU cleotides 

rr.. n7sJ a fracmient was created by syntnesi/,ui to iw r 
„ Thr DNAft ^^rtapsw.re^dandfilledtaby 

that overlapped at their 3 enas oy u F 



Taq polymerase. 

u- flflp OP nes from the defective prophage was heat- 
Fynression of lambda recombmase genes trom mc u 

Expressions . FL AG-tagged cassette was 

induced in DY380 cells carrying the ^ ^ ^ spread on LB plates to a 

introduced into the cells by electroporation. The cell w 

densityof~2^^ 

.entifledbycolonyhybridi^ 

bp FLAG tag and 3 bp on each side that was homolog 

„ere identic. ^co^^FLAG^dveco^w^^m^O 

^ — &om - r r — ^ 

amplification and direct sequencing showed that each 
was correctly targeted. 

As unopnvocauy denronsbafcd in this example, to surprising., high 

« • v offered bv this recombination system makes ,t posable to 
meombinanon effleteney offered by fht ^ 

m ani P ula,e BAC or other DNA wrmou, dm g a iecbon. ^ 

insertions can now be engineered ^^^^^^ In caseswhere the gene is mutated in 
«ed dmg seiecrion marker or a « . m~ &e ^ md ^ 

human disease, the exact disease-caustngmutatrons can gm 
effect of these mutations analyzed in transgenic mice. 

EXAMPLE 22 
Materials and Methods Used in Examples 17-21 

a „vr-pnt DH10B were maintained at 32°C 
Bacterial strains. All of the strains used except DHlUts 
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of the temperature inducible prophage. DY303 was 



constructed by infecting 
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DH10B cells (Gibco) with a X phage carrying recA (M857 recA+) (a gift from F. W. 
Stahl) and lysogens were selected. Strain ELI 1 was constructed by replacing 
of DY380 with a cassette containing the cat and sacB genes by selecting CmR. ELI 1 cells 
are Tet s Cm* and sensitive to 2% sucrose. Strain EL250 was constructed by replacing the 
cat-sacB cassette of ELI 1 cells with araC and the arabinose promoter-driven^ 
recombinase gene Qfrrfpe) selecting in the presence of sucrose. EL250 cells are 
resistant to 2% sucrose. Strain EL350 was constructed in a similar manner except for ere 

instead of flpe. 

Constructionofplasmid.^mS-eGFPcre^ 
was PCR amplified from P ICGN21, which was constructed by subclomng a 1.9 kbp 
Hfndm/^I-digested and filled-in FRT-kan-FRT fragment from pFRTneo mto the 
iV^c/I-digested and filled-in cloning site of pIRESeGC. The FRTneo was constructed 
by ampHfying the kan gene along with the Beta lactamase promoter from pEGFP-Cl 
(Clontech) with primers 

5'CTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTnTCGTCAGGTGGC 
ACTTTCGGG (SEQ ID NO-13) and 5'CTCAGAAGAACTCGTCAAGAAGG (SEQ ID 
NO- 14) The amplified fragment was then targeted between the FRT ate in P NeoB-g<d 
(Sttatagcne). The pKESeGC was generated by inserting the 2 ftp HUM**** 
and filled-in eGPPcre fiagmen, ftom pEGC into the 3.5 kbp ^-digested and fiBed-m 
doning site of pNTRlacZPCKneotaP (Atango m at., 1999). The pEGC was generated by 
Cloning a 1.05 ftp EeoRVKpnl PCR fragment containing the ere gene fiom pGKmnere 
(a gift fiom P. Soriano) into fire EcoWKpnl site of pEGFP-Cl . This PCR fiagmen. was 
generated by amplifying the ere gene ftom pGKmnere with primers 



5 

25 5 



enerated by amplifying the ere gene nom r— 

GTAGGTACCTCGAGAATCGCCATCTTCCAGCAGGC (SEQ ff>NO:15) and 

JTCGAATTTTCTGCATCCAATTTACTGACCGTACACC (SEQ ID NO:16), whieb 
contain EeoK and Xj.nl cleavage sites, respectively, at theft 5' ends. 

To constmc. the pTamp vector, the ^-targeted pBeloBACl 1 was first generated 
hy replacing the taP site in pBeloBACl 1 (Shiznya * at, 1992) with the PCR ampltfieft 
amp gene from pEGFP (Clontech). The primers nsed for amplificatton are 
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y GC AAGTGTGTCGCTGTCGACGAGCTCGCGAGCTCGGACATGAGGTTGTCTIAGA 
CGTCAGGTGGCAC (SEQ ID NO:17) and 

yCATAGTTAAGCCAGCCCCGACACCCGCCAACACCCGCTGACGCGAACCYCkCGl 
TAAGGGATTTTGGTC (SEQ ID NO: 18), which are homologous to the amp gene of 
5 pEGFP (in plain) and to sequences flanking the loxP site in pBeloBACl 1 (in italic). A 
2.4-kbp PCR fragment amplified from amp-targeted pBeloBacll with primers 
5'GCAGGATCCAGTTTGCTCCTGGAGCGACA (SEQ ID NO:19) and 
5'TGCAGGTCGACTCTAGAGGATC (SEQ ID NO.20) was then cloned into the 
XhoVXbal and filled-in site of pCS (Stratagene) to create the pTamp vector. The 2.4 kbp 
10 amp cassette containing an amp gene along with 920 bp of 5', and 370 bp of 3', 

pBelOBACl 1 vector sequence flanking the loxP site can be released by BamRl digestion 
and used directly to replace the loxP site in any pBeloBACl 1 -derived BACs with amp. 

The pK04 vector containing the cat-sacB targeting cassette is a derivative of 
pK03 (Link et al, 1997) in which 605 bp had been deleted between cat and sacB. 
15 The anzC Wp* targeting cassette was amplified from pBADflpe, which was 

constructed by subcloning a 1 .4 kb PstVKpnl fragment from pOGFlpe (Buchholz et al., 
1998) into pBAD/MycHis-A (Invitrogen). The araC-V^cre targeting cassette was 
amplified from pB ADcre, which was constructed by introducing a 1 .2 kb HindN/Ncol 
fragment from pGKmncre into pBAD/His-C (Invitrogene). 
20 Amplification primers for targeting or GAP repair cassette DMAs. For all primers 

listed below, nucleotides in italics are homologous to the targeted sequence, while those in 
plain text are homologous to amplification cassettes. The Tet* cassette used for targeting 
cro-bio in DY330 was amplified from TnlO with primers: 

5TGGCGGTGATAATGG7TGCATGTACTMGGAGGTTGTATGCTCHGGGTYATCAA 

25 GAGGG (SEQ ID NO:21) and 

S'GGCGCTGCAAAAATTCTTTGTCGAACAGGGTGTCTGGATCACTCGACATCTIGG'T 

TACCG (SEQ ID NO:22). The cat-sacB cassette used for replacing the tet gene in DY363 
was amplified from pK04 with primers: 

5'TGGCGGTGATAATGGTTGCATGTACTAAGGAGGTTGTATGCTGIGACGGAAGAT 
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CACTTCG (SEQ ID NO :23) and 

5'GGCGCTGCAAAAATTCTTTGTC GAA CA GGGTGTCTGGA rCCTGAGGTTCTT ATGG 
CTCTTG (SEQ ID NO:24). The araC-? B ^pe and araC-V BAD cre cassettes used for 
replacing the cat-sacB in ELI 1 were amplified from pBADflpe and pBADcre with 

5 primers: ^ A t. a 
5TGGCGGTGATAATGGTTGCATGTACTAAGGAGGTTGTATGAAGCGGCATGCATA 

ATGTGC (SEQ ID NO:25) and 

yGGCGCTGCAAAAATTCTTTGTCGAACAGGGTGTCTGGATCCTGTGTCCTACTCAG 
GAGAGCGTTC (SEQ ID NO:26). The IRES-eGFPcre-FRT-kan -FSTcassette used for 

10 targeting the Eno2 locus was amplified from pIGCN21 with primers: 

yCGCTTCGCGGGACATAATTTCCGAAATCCCAGTGTGCTGTGAGCCAAGCTAlCGA 

ATTCCGCC (SEQ ID NO:27) and 

SVAGGCTCCAGGAGAATGAGATGTTCCCGCGTTCAGGCAAGCGCTATTCCAGAAG 
TAGTGAGGA (SEQ ID NO:28). The oligonucleotides used to target the flag cassette 
15 into the 5' end of the Sox4 gene were annealed and polymerase-extended using primers: 
S'GCGAGCGTGTGAGCGCGCGTGGGCGCCCGGCAAGCCGGGGCCATGGATIACA 

AGGATGACGACGATAAGGTACAACAGA (SEQ ID NO:29) and 
5'GGCCAGCAGAGCCTCAGTGTTCTCCGCGTTGTTGGTCTGTTGTACCTT ATCGTCG 

. . a r-r^r^r-nnrr f^VCt TT> NO:30Y 
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TCATCCTTGTAATCCATGGCCCCC (SEQ ID NO:30). 

The linear P BR322 derivative used to subclone the 25-kbp fragment from the 
modified Eno2 locus was amplified with primers: 

yCTCTCCATGCCTGTCTGGGTGAGGGTGGCCCAGGGGCGATGGCTATGAGAGAGG 

TCGACTTCTTAGACGTCAGGTGGCAC (SEQ ID NO:31) {Eno2-C-U) and 
GCAATGCAGAGAAGCCTTGTACTGGGATGACAGAGACGGAGGGGAAGAGGAGGCG 

GCCGCGATACGCGAGCGAACGTGA (SEQ ID NO:32) (W-C-Rl/2). The 

amplification primers for the other experiments were: 48-kbp modified fragment, 
S'GACTTCTATGACCTGTACGGAGGGGAGAAGTTTGCGACGTGACAGAGCTGGTCGT 

CGACTTCTTAGACGTCAGGTGGCAC (SEQ ID NO:33) (Eno2-C-Um and Eno2-C 
Rl/2; 60-kbp modified fragment, £wo2-C-L2/3/4 and 
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.OCCCCATACACGTAAATGTAGATAGMTCACACAGCATCACnCTAIGGA^OGC 
GGCCGCGATACGCGAGCGAACGTGA (SEQ ID NO:34) Qh*U% 80-kbp 

^^^^OCrAGAGCCUGGCCArCrAAGrGACAG^ 

GCCGCGATACGCGAGCGAACGTGA (SEQ ID NO:35) ***** P™- 

contained 5' region homologous to me Urge, sequence and 3' regions homologous to 
p B R 322.PCRp I odue,swe re punf i ecusm g aQiaexIIgele Xt ruetionlci t (Q 1 ageu)and 

digested with Of.nl to remove contaminated template. 

Prepare ofeUarocanp*** celts animation of recount, For BAG 
modulation overnight cultmes containing me BAG were grown from single colomes and 

foUowed by chilling on ice for 20 min. Cells men were centrifugod fc r 5 mm at 5,500 g 
« and washed wim 1.5 ml of ice-cold sterile water three tunes. Cells were ften 
upended in 50 pi of ice-cold atonic water and electtnporated. For BAC transformation, 
the induction step was omitted. 

Cell transformation waa performed by eiectroporation of 100-300 ng linear DNA 
tato 5 „ pi of ice-cold concpeten, ce«s in cuvetics (0, cm) uaing a Bio-Rud gene P „ ser set 
„ , 75 W. 25 uE with pulse contioller se« at 200 ohms. 1 ml of LB medium was added 
20 after dectroporation. Ce 1 l S weremcuba t eda«3rC fi >r,.5hrwimsha ki „gandspreadon 

appropriate selective or nonselective agar media. 

o/,— mi c. Modified BAC and the ^ 
were pruned uahrg cesium ch, ri de gradren* - — 



15 



25 



were punned using cesium «™»» s* rap DNA 

kbp subclone DNA was linearized by Notl digestion before nucrornj ection. BAC 
(Wntf) and 25-lcbp subclone DNA (2ug.nl) were microinjected into the pronuclei of 
CSH/HeN-Mtv" X C57Bl76Ncr) F 2 zygotes. Transgenic founders were subsequently 
identified by Southern analysis using a ere probe or by PGR using prnners 
5 ' CTGCTGGAAGATGGCGATTCTCG (SEQ ID NO:36) and 
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5 ' AAC AGC AGGAGCGGTGAGTC (SEQ ID NO:37) that flank the 3' insertional 
junction. 

Histochemical analysis of fi-galactosidase expression. Mice at 4 to 5 weeks of age 
were sacrificed in C0 2 and perfused with 4% paraformaldehyde in PBS (pH 7.3). The 
5 b rains,s P inalcordsandeye S werere m oveda 1 rdpostfixedfor3hr. Vibratome sections (20 

um) of brains were mounted on slides and used directly for X-gal staining or for 
immunocytochemistry. For spinal cords and eyes, cryostat sections (20 urn) were used 
that were made by cryoprotecting tissues in 30% sucrose in PBS overnight and embeddmg 
the tissues in freezing compound (OCT, Sakura). Before X- g al staining, samples on shdes 
10 were postfixed with 0.25% glutaraldehyde in PBS and briefly washed with rinse solution. 
(0 1 phosphate buffer P H7.3, 0.1% deoxycholic acid, 0.2% NP40 and 2mM M g Cl 2 ). X- g al 
staining was performed by incubating samples in staining buffer (2.5 mg/ml X-gal, 5mM 
potassium ferricyanide and 5 mM potassium ferrocyanide in staining buffer) for 2 hr at 
37°C followed by counterstaining with 0.25% eosin (Fisher). 
15 Immunocytochemistry. Immunostaining was carried out using the ABC Vectastain 

kit (Vector Labs) on 20 urn vibratome sections. Sections were blocked with PBS (pH7.3 
containing 0.2% Triton X-100, 1 .5% bovine serum albumin and 5% normal goat serum) at 
room temperature for 2 hr and incubated with primary Eno2 antibody, a poly clonal rabbit 
anti-^2 antiserum (Chemicon) at 1:100 dilution inPBS solution. After incubation wrth 
20 a secondary biotinylated antibody and the ABC reagent, peroxidase was reacted with 
0.05% diaminobenzidine tetrahydrochloride (DAB) and 0.003% hydrogen peroxide. 

Unless otherwise defined, all technical and scientific terms used herein have the 
same meaning as commonly understood by one of ordinary skill in the art to which this 
mventionbelongs. Although methods and materials similar or equivalent to those 
25 described herein can be used in the practice or testing of the present invention, suitable 
methods and materials are described below. All publications, patent applications, 
patents, and other references mentioned herein are incorporated by reference m their 
entirety. In case of conflict, the present specification, including definitions, will 



PCT/US01/25507 

WO 02/14495 



92 

control. In addition, the materials, methods, and examples are illustrative only and not 
intended to be limiting. 
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We Claim: 

,. Ame.hodforperfonninghomolo^recomWna.ionofanudeoUdese^ce 

of interest in a host cell, comprising 

introducing a nucleic acid seduce encoding Exo, Beta and Can, or 
fcncnona, variant or homo,ogue thereof, operab.v tinted to a repress* promoter nrto 

a population of host cells; to 
introducing a polynucleotide comprising a nucleotide science homologous 

to nucleotide sequence ofintereat into me host cells; , Wramssio „ 
activating the tcpressib.e promoter in me cells, thereby tnductng me expressron 

ofExo, Beta and Gam, or the factional variant or homologue dtereof 

selecting a eel! from the population of host cells in which homologous 

acid sequence. 

2 The method of claim 1, wherein a, least one cell in 100,000 cells from the 
populationofhostcelUcontainsDNAinwhiohhomologousreoombinattonhas 

occurred. 

, The mettrod of claim 1, wherein at Las. one eel, in 1 ,000 cells from the 
population of host cells contain. DNA in which homologous recombination has 
occurred. 

4 The method of claim 1, wherein a, leas, one eel! in 100 cells from tire 
potion of cetis contains DNA m which homologous recombination haa ocemxed. 

5. ThemettrodofolaimS.whereinfhenucleotideseduenceoftaterestis 
intrachromosomal or exliaohiomosomal. 
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6 The method of claim 5, wherein the nucleotide sequence of interest is 
extrachromosomal and located on a bacterial artificial chromosome, a PI artificxal 
chromosome, plasmid, cosmid, or a yeast artificial chromosome. 

7. The method of claim 1, wherein the homologous recombination alters a 
function of a gene in the cell. 

8. The method of claim 1. wherein the nucleotide sequence homologous to the 
nucleotide sequence of interest is at least 30 base pairs in length. 

9. A method for inducing homologous recombination in a cell comprising a 

target nucleic acid, the method comprising:. 

introducing into a cell a first nucleic acid of interest comprising a homologous 
sequencetometargetnucleicacidofsufficientlengmtoinducehornologous 

recombination, wherein the cell comprises a nucleic acid encoding a single stranded 
DNA binding polypeptide that functions in double strand break repair homologous 
recombination, operably linked to a de-repressible promoter; and 

inducing sufficient expression of the nucleic acid encoding the single stranded 
DNA binding polypeptide to promote homologous recombination of the first nucleic 
acid of interest with the target nucleic acid; 

thereby inducing homologous recombination in the cell. 

10. The method of claim 9, wherein the first nucleic acid of interest is a single 
stranded DNA and or a DNA comprising a 3' overhang. 

11. The method of claim 10, wherein the homologous sequence is at least 70 
nucleotides in length. 

12. The method of claim 9, wherein the first nucleic acid of interest is a double 
stranded DNA comprising a homology arm. 
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13. The method of claim 12, wherein first nucleic acid of interest compnses a 
homology arm of at least 20 nucleotides in length. 

14 The method of claim 9, wherein the single stranded DNA binding 
polypeptide is selected from the group consisting of: lambda Beta, RecT, P22 Erf, and 
Rad52. 

15 The method of claim 9, wherein the single stranded DNA binding 
polypeptide is lambda Beta, and the cell further comprises a nucleic acid encoding Gam 
and a protein selected from the group consisting of Exo, Arabidopsis F12A21.16, 
cholera virus orf,B. subtilis YqaJ gene, Listeria phage All 8 gp47,B. subtilis phage 
SPPl gene 34.1, and African swine virus orf operably linked to the de-repressmle 
promoter. 

16 The method of claim 9, wherein the first nucleic acid sequence of interest is 
single stranded, and wherein the method further comprises introducing a second smgle- 
stranded DNA sequence, wherein the second single-stranded DNA sequence compnses 
more than 10 bp of complementary overlap at either the 5' or 3' terminus of the first 
nucleic acid of interest. 

• 17. The method of claim 9, wherein the cell is abacterial cell. 
18. The method of claim 9, wherein the cell is a eukaryotic cell. 



19. The 



20. The 



method of claim 9, wherein the eukaryotic cell is a mammalian cell, 
method of claim 19, wherein the eukaryotic cell is a stem cell. 
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" 21. The method of claim 9, wherein the de-re P ressible promoter is the PL 
promoter. 

22. The method of claim 9, wherein the firs, nucleic acid of interest encodes a 

« 

selectable marker. 

23. A method for inducing homologous recombination in a cefi comprising a 
target nucleic acid, the method comprising: 

contacting the cell with a single stranded DNA molecule and a single stranded 
DNAhmdingpolypepfidematfimcfionsmdoubles^dhresitrepahhom^ogou, 

recombination, wherein the single stranded DNA molecule comprises a sufficient 
number of nucleotide, homologous to the target nuc.eio acid to enable homologous 

recombination with the target nucleic acid; and 

inmtducingme single sfianded DNA module andthe surgle stranded bindntg 
polypeptide into die cel., whereby homotogous recombmauon of the singfe stranded 
DNA molecule with the target nucleic acid in the cell is induced. 

24 The method of claim 23, wherein the single stranded DNA molecules 
comprises a. leaa. about 70 nucleotides homologous to fire targe, nnclerc aotd 

25. The method of claim 23, wherein fire ce.1 is cnnttCedwifir a solufionttra, 
comprises about 0.001 pM ,o about 10 mM single suanded DNA. 

26. The method of claim 23, wherein the cell is contacted with a solution that 
comprises about 0.1 uM to about 1 mM single stranded DNA. 

27. Themethodofclaim23,whereinthecelliscontactedwithasolutionthat 
comprises about 1 uM to about 100 pM single stranded DNA. 
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28 Theme*odofc.aim23,wherei„ fll ec.l 1 iscon,ac t edw i <hasolu ttm « 
comprises about 0.01 pM to about 10 tnM single branded DNAbinding polypeptide. 

29 tt e method of 23, wherein tire coll is contacted with a solution that 
comprises about 0.1 uMto 1 mM single stranded DNA binding polypeptide. 

30 Tno method of 23, wheretn the oeU is contacted with a solution una. 
comprises about 1 uM to 100 pM sing.e stianded DNA binding po.ypepride. 

31 The method of claim 23, whereinfhe ssDNAhinding polypeptide is 
scfected fiom the group consisting of .ambda Beta, RecT, P22 Erf, and R ad52. 

32 The method of claim 23, wherein single stranded DNA comprises about 20 
nucleotide, to about 1,000 nucleotides homologous to the target nucleic acid sconce. 

33. A method for inducing homologous recombination in a host cell 

comprising a target nucleic acid, comprisingt , h „„„, 0 „ v 
introducing into me cefl a double shaded nucleic acd compnsmg a homology 

arm of sufficient long* to induce homofogous ruination, wherrin fte cell 

^ a nucleic acid encodhrg Beta operahly tinted to a 

inducing repression of Beta to promote homologous recombtnationbetween the 

double stranded nucleic acid and the targe, nucleic acid in the host cell. 

34. The method of claim 33, wherein the double stranded nucleic acid 
comprises a 5' overhang. 

35. Th.memodofclaim33,whereinmenucleicacidse q uence=„codingBem, 
functional variant or fragment thereof, is intrachromoaomal. 



or a 
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36 The method of claim 33, wherein the nucleio acid sequence encoding Beta, 
or a functional variant or fragment thereof, is extrachromosomal. 

37. The method of claim 33, wherein the cell further comprises a nucleic acid 
encoding gam operably linked to the de-repressible promoter. 

38. Thememodofclaim33,whereintheceUMhercom P risesanucleicacid 
encoding exo operably linked to the de-repressible promoter. 

39. Themehtodofclaim37,whereinthecellfurthercomprisesanucleicacid 
encoding gam operably linked to the de-repressible promoter. 



40. The 



41. The 

42. The 
defective strain. 



method of claim 33, wherein the host cell is a bacterial cell, 
method of claim 33, wherein the bacterial cell is E. coli. 



method of claim 33, wherein the E. coli is a host recombination 



43. The method of claim 42, wherein the host recombination defective stram is 
a rec A- strain or a rec A+ strain. 

44. The method of claim 33, farther comprising treating the host ceil to enhance 
macromolecular uptake. 

45 The method of claim 44, wherein the method to enhance macromolecuhu 
is selected torn tie group consist of: e.ectroporanon, calcium phosphate- 
DN A co-precipitation, DEAE-dextran-mediated transferor, poiybmne-mem^ 
Z»L. nlroiniechon, liposome fusion, ligation, protoplaa, ^ novated 
aHenovhus.menia.ed transfer, HVJ-lrposome mediated transfer, and hrohshes. 
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46 The method of claim 33, wherein the double stranded nucleic acid 
comprises a nucleic acid sequence encoding a positive selectable marker or a negative 
selectable marker. 

47. The method of claim 33, wherein the homology arm comprises at least 20 
nucleotides homologous to the target nucleic acid sequence. 

48. The method of claim 47, wherein the homology arm comprises at least 30 
nucleotides homologous to the target nucleic acid sequence. 

49 A method for cloning a nucleic acid molecule in a cell comprising: 
introducing into a host cell comprising a nucleic acid sequence encoding Exo, 
Beta and Gam, or a functional variant or homologue thereof, operably linked to a de- 
repressible promoter, a vector comprising a sufficient number of nucleotides 
homologous to a target nucleic acid to achieve homogologus recombmahon; 

activating the de-repressible promoter in the host cell, thereby inducmg the 
expression of Exo/Beta and Gam, or the functional variant or homologue thereof 

selecting ahost cell in which homologous recombination has occurred between 
the vector and a target nucleic acid sequence, thereby cloning the nucleic acid molecule. 

50. Abacterialcellcomprisingadefectivelambdaprophageofgenotype 
AcI857 A(cro-bioA). 

51. The bacterial cell of claim 47, further comprising a nucleic acid sequence 
encoding a selectable marker upstream of cI857. 

52 The bacterial cell of claim 51, wherein the nucleic encoding a selectable 
marker encodes a protein that confers resistance of the bacterial cell to an antibiotxc. 
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53 Uebac.enalcenofolataSO.ta.hercomprising.nMuciWepromo,. 
«J, «*». - a - * - sconce encoding a * specie recombutase 
operably linked to the inducible promoter 

54 The bacteria. ce„ of data 53, herein ,b. recombnrase is Hp Pipe, Cre, a 

rm e. fictional fragments or vanant of Flpe or a 
functional fragment or vanant of Hp, a functional tragm 

functional fragment or variant of Cre. 

55. The bacterial cell of c.aun 54, wherein the recomburase is rip, Hpe, or Cre,. 

56. The bacteria, eel of claun 47, further comprising a bacteria, artificial 
chromosome. 

, , • c< «twrpin the bacterial artificial chromosome 

57 The bacterial cell of claim 56, wherem trie oacxen 

comprises a nucleic acid sequence encoding a selectable marker. 

, n r i ■« <7 wherein the nucleic acid sequence encoding 

58 The bacterial cell of claim 57, wherein me 

recomized by the site specific 
the selectable marker is flanked by recombination sites recogmz 

recombinase. 

, 11 45 m « wherein the recombination sites are tap or 
59. The bacterial cell of claim 58, wherein uw 



F-RT sites. 



,0 Thebaetcria.cenofc.aimS^hesemnnebacteria.arnncia.cnromosome 
comprises at leas, one exon or one inuon of a mammauan gene. 
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61. A method of altering a eukaryotic nucleic acid sequence, comprising: 
introducing a nucleic acid of interest into a bacterial cell comprising an 
extrachromosomal element comprising least one intron or at least one exon of a 

eukaryotic nucleic acid sequence, 

wherein the nucleic acid of interest comprises a sufficient number of nucleoUdes 
homologous to the eukaryotic nucleic acid sequence to mediate homologous 
recombination, 

and wherein the bacterial cell comprises a de-re P ressible promoter operably 
linked to a nucleic acid sequence encoding a recombinase; and 

activating expression of the recombinase from the de-repressible promoter; 

wherein expression of the recombinase results in the nucleic add of interest 
undergoing homologous recombination with the eukaryotic nucleic acid sequence. 

62. The method of claim 61, wherein the recombinase is a single-stranded DNA 
binding protein. 

63. The method of claim 61, wherein the recombinase is a double-strand break 
repair recombinase. 

' 64, Themethodofclaim61,wheremmeDNAencodingtherecombinase 
encodes lambda Beta. 

■ 

65. The method of claim 64, wherein the DNA encoding the expressed 
recombinase further encodes lambda Exo. 

* 

66. The method of claim 64, wherein the DNA encoding the expressed 
recombinase further encodes lambda Gam. 
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67 The method of claim 61, wherein the extachromosomal element is a 
bacterial artificial chromosome, a yeast artificial chromosome, a PI artificial 
chromosome, a plasmid, or a cosmid. 

68 . to^*^**.^**^^^*^** 
mammalian nucleic acid sequence. 

A The method of Cairn 6,, whereto ft. nuc.eic acid of interest encodes an 
epitope tag. 

70 AmemodofaubclomngaDNAsequence,comprising: 
pro viding foe bacterial ceU comprising a nuOeic acid sequence encotag B*o, 
Beta and Qanr, or a mnctiona, variant or homologue thereof, operably Weed to a do- 
Beta ana oam, mo i e cule with a target nucleic acid 

repressive promoter, and further comprising a DNA molecule 

sequence located extrachromosomally; 

introducing into the baefcrial ceUa hnear plasmid vector compnsmg an ongm of 
^cation and a selectable marker, and comprising a nucleotide se q uence capable of 
Xomg^ogons^b^witomorargetnoe^oaoid^onmo 

indncmg me expression of the nucleic acid encoding Exo. Beta and Gam 

plasmid vector; and 

circutaizing me linear plasmid vector, thereby anbclomng me DNA. 

n . The method of claim 70, wherein the DNA comprises at least about 20 kb of 
the extrachromosomal DNA. 

,2. The memod of claim 70, wherenr the DNA compriaea a. leas, about 40 kb of 
the extrachromosomal DNA. 
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73. The method of claim 70, wherein the DNA comprises at least about 80 kb of 
the extrachromosomal DNA. 

74. The method of claim 70, wherein the extrachromosomal DNA is a bacterial 
artificial chromosome. 

» 

75 A mini-lambda comprising 

a lambda genome comprising a de-repressible promoter operably linked to a 
nucleic acid sequence encoding Beta, Exo, and Gam a nucleic acid sequence encodmg a 
selectable marker, an attP site and a nucleic acid sequence encoding an mtegrase, 
wherein the lambda genome lacks a replication origin. 

76. The mini lambda of claim 75, wherein the de-repressible promoter is PL. 

77. ThemM4ambdaofclaim75,furmercomprisinganucleicacidsequence 

encoding a selectable marker. 

78 Themim4ambdaofclaim75,whereintheselectablemarkerconfers 
antibiotic resistance or antibiotic sensitivity to a bacterial cell harboring the mmi- 
lambda. 

79. The mini-lambda of claim 75, wherein the mini-lambda comprises a lambda 
genome comprising a deletion of the genes from era through ea59. 

80 The mini-lambda of claim 75, further comprising a gene required for cell 
lysis, such that the mutation in lambda is deficient in inducing lysis of abacterial cell. 
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81 A method for producing a bacterial cell containing a chromosome 
comprising a de-repressible promoter operably linked to a nucleic acid sequence 
encoding lambda Bet, Gam, and Exo, comprising 

transforming a bacterial cell comprising an attB site on a chromosome of the 

bacterial with the mini-lambda of claim 75, 

thereby integrating the mini-lambda into the chromosome at the attB site, and 

producing the bacterial cell. 

82. The method of claim 81, wherein the bacterial cell further comprises a 
bacterial artificial chromosome (BAC). 



83. The 



84. The 



method of claim 81 , wherein the bacterial cell is an E. coli cell, 
method of claim 81, wherein the E. coli cell is aDHlOB cell. 
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FIG. 4 
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FIG. 6 
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SEQUENCE LISTING 

<110> THE GOVERNMENT OF THE UNITED STATES OF AMERICA, AS REPRESENTED. BY 
THE SECRETARY OF THE DEPARTMENT OF HEALTH AND HUMAN SERVICES, NATIONAL 
INSTITUTES OF HEALTH 

LEE , E- CHIANG 

COPELAND , NEAL G. 

JENKINS , NANCY A. 

COURT, DONALD L . 

YU, DAIGUAN 

ELLIS , HILARY M. 

<120> ENHANCED HOMOLOGOUS RECOMBINATION MEDIATED BY LAMBDA 
RECOMBINATION PROTEINS 

<130> 4239-59371 

<150> US 60/271,632 
<151> 2001-02-26 

<150> US 60/225,164 
<151> 2000-08-14 

<160> 37 

<170> Patentln version 3.1 

<210> 1 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 1 

tgtgacggaa gatcacttcg 20 



<210> 2 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 2 

accagcaata gacataagcg 20 



<210> 3 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 3 

ctcttgggtt atcaagaggg 20 



1 
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<210> 4 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 4 

actcgacatc ttggttaccg 20 

<210> 5 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 6 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 6 

agagttggta gctcttgatc 20 



<210> 7 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<210> 8 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 8 

tcagaagaac tcgtcaagaa g 21 



<400> 5 

cattcaaata tigtatccgct c 



21 



<400> 7 

tatggacagc aagcgaaccg 



20 



2 
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<210> 9 

<211> 50 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Homology segment 

<400> 9 

gtttgcgcgc agtcagcgat atccattttc gcgaatccgg agtgtaagaa 50 



<210> 10 

<211> 50 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Homology segment 

<400> 10 

ttcatattgt tcagcgacag cttgctgtac ggcaggcacc agctcttccg 50 



<210> 11 
<211> 70 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Homology segment 
<400> 11 

aagtcgcggt cggaaccgta ttgcagcagc tttatcatct gccgctggac ggcgcacaaa 60 
tcgcgcttaa 70 



<210> 12 

<211> 60 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Homology segment 

<400> 12 

aacagacacc atggtgcacc tgactcctga ggagaagtct gccgttactg ccctgtgggg 60 



<210> 13 
<211> 56 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 13 

ctgcaaggcg attaagttgg gtaacgccag ggttttcgtc aggtggcact ttcggg 56 

3 
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<210> 14 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 



<400> 14 

ctcagaagaa ctcgtcaaga agg 23 



<210> 15 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 15 

gtaggtacct cgagaatcgc catcttccag caggc 35 



<210> 16 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 16 

tcgaattttc tgcatccaat ttactgaccg tacacc 36 



<210> 11 
<211> 67 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 17 

gcaagtgtgt cgctgtcgac gagctcgcga gctcggacat gaggttgtct tagacgtcag 60 
gtggcac 67 



<210> 18 

<211> 69 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 18 

4 
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catagttaag ccagccccga cacccgccaa cacccgctga cgcgaacctc acgttaaggg 60 
attttggtc 69 



<210> 19 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 19 

gcaggatcca gtttgctcct ggagcgaca 29 



<210> 20 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 20 

tgcaggtcga ctctagagga tc 22 



<210> 21 

<211> 60 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 
<400> 21 

tggcggtgat aatggttgca tgtactaagg aggttgtatg ctcttgggtt atcaagaggg 60 



<210> 22 

<211> 60 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Primer 
<400> 22 

ggcgctgcaa aaattctttg tcgaacaggg tgtctggatc actcgacatc ttggttaccg 60 



<210> 23 

<211> 61 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 
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<400> 23 

tggcggtgat aatggttgca tgtactaagg aggttgtatg ctgtgacgga agatcacttc 60 



<210> 24 

<211> 61 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 



Sequence 



<400> 24 

ggcgctgcaa aaattctttg tcgaacaggg tgtctggatc ctgaggttct tatggctctt 60 
a 61 



<210> 25 

<211> 60 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 25 

tggcggtgat aatggttgca tgtactaagg aggttgtatg aagcggcatg cataatgtgc 60 



<210> 26 

<211> 65 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 26 

ggcgctgcaa aaattctttg tcgaacaggg tgtctggatc ctgtgtccta ctcaggagag 60 
cgttc 65 



<210> 27 

<211> 63 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 27 

cgcttcgcgg gacataattt ccgaaatccc agtgtgctgt gagccaagct atcgaattcc 60 
gcc 63 



<210> 28 
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<211> 63 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 28 

gaggctccag gagaatgaga tgttcccgcg ttcaggcaag cgctattcca gaagtagtga 60 



gga 



<210> 29 

<211> 79 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



63 



<400> 29 

gcgagcgtgt gagcgcgcgt gggcgcccgg caagccgggg ccatggatta caaggatgac 60 
gacgataagg tacaacaga 79 



<210> 30 
<211> 79 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 30 

ggccagcaga gcctcagtgt tctccgcgtt gttggtctgt tgtaccttat cgtcgtcatc 60 
cttgtaatcc atggccccc 79 



<210> 31 
<211> 80 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Primer 
<400> 31 

ctctccatgc ctgtctgggt gagggtggcc caggggcgat ggctatgaga gaggtcgact 60 
tcttagacgt caggtggcac 80 



<210> 32 

<211> 79 

<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Primer 
<400> 32 

gcaatgcaga gaagccttgt actgggatga cagagacgga ggggaagagg aggcggccgc 60 
gatacgcgag cgaacgtga 7 9 



<210> 33 
<211> 80 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 33 

gacttctatg acctgtacgg aggggagaag tttgcgacgt gacagagctg gtcgtcgact 60 
tcttagacgt caggtggcac 80 



<210> 34 
<211> 81 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 34 

gccccataca cgtaaatgta catagaatca cacagcatca cttctatgga tgcggcggcc 60 
gcgatacgcg agcgaacgtg a 81 



<210> 35 
<211> 79 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 35 

catccagtag aacttgggag tgaagctaga gccaaggcca tctaagtgac aggcggccgc 60 
gatacgcgag cgaacgtga 79 



<210> 36 

<211> 23 " 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 36 

ctgctggaag atggcgattc teg 23 
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<210> 37 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 37 

aacagcagga gcggtgagtc 



20 
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STATEMENT ACCOMPANYING SEQUENCE LISTING 

The sequence listing does not include matter that goes beyond the disclosure in the 
international application. 

The printout of the attached Sequence Listing is identical to the computer readable sequence 
listing on the enclosed computer disk. 



